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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: . ^ *)[° If, 1 7? 

Application Serial Number: Q> °) ^ I I f L>' 




Source: 

Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PI F ASF FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

I f^OT^cJc»??OF SlS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 

APPI ICANT WITH A NOTICE TO COMPLY or, 
2) SSIffllCANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 

FOR N C^MIsS L QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX 
PATENTIN 2.1 e-mail help: P «t8n?.1 holn@usntO.gov or phone 703-306-41 19 (R. Wax) 
PATENTIN 3.0 e-mail help: patin31.elr,@.usr>to.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
w^TON ^ PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence l.st.ng is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1821 - 1.825 effective October 1 1990 (old 
rales) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTOY 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

rh^r Version * n on he down | f .H»H from th« 1JSPTO website at the following address : 

http://www.uspto.gov/web/offices/pac/checker' 
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RAW SEQUENCE LISTING DATE: 10/15/2001 

PATENT APPLICATION: US/09/039 , 177D TIME: 13:47:03 

Input Set : A:\Lud-5539 sequence listing. tzt 
Output Set: N:\CRF3\10152001\I039177D.raw 

SEQUENCE LISTING 
3 (1) GENERAL INFORMATION: 



5 (i) APPLICANT: Kohei MIYAZONO; Takeshe IMAMURA; Peter DEN DUKE 

7 (ii) TITLE OF INVENTION: ISOLATED ALK-1 PROTEIN, NUCLEIC ACIDS ENCODING 

8 IT, AND USES THEREOF 
10 (iii) NUMBER OF SEQUENCES: 46 

12 (iv) CORRESPONDENCE ADDRESS: 

13 (A) ADDRESSEE: Fulbright & Jaworski L.L.P. 

14 (B) STREET: 801 Pennsylvania Avenue, N.W. 

15 (C) CITY: Washington 

16 (D) STATE: District of Columbia 

17 (E) COUNTRY: USA 

18 (F) ZIP: 20004 Dopq M to 

20 (v) COMPUTER READABLE FORM: p oe S IMOl Comply 

21 (A) MEDIUM TYPE: Diskette, 3.25 inch, 1.44mb corrected DlskettG Needed 

22 (B) COMPUTER: IBM PS/2 ^ 

23 (C) OPERATING SYSTEM: PC-DOS .S>^ tf-+-*Jt ^\ I ^ ( 

24 (D) SOFTWARE: WordPerfect IV ' ' 
26 (vi) CURRENT APPLICATION DATA: /h^^zJl ^1 

C--> 27 (A) APPLICATION NUMBER: US/09/0 39 , 17 7D 

C--> 28 (B) FILING DATE: 13-Mar-1998 

29 (C) CLASSIFICATION: 435 

59 (vii) PRIOR APPLICATION DATA: 

32 (A) APPLICATION NUMBER: PCT/GB93/02 367 

33 (B) FILING DATE: November 17, 1993 

36 (A) APPLICATION NUMBER: GB 9224057.1 

37 (B) FILING DATE: November 17, 1992 

40 (A) APPLICATION NUMBER: GB 9304677.9 

41 (B) FILING DATE: March 8, 1993 

44 (A) APPLICATION NUMBER: GB 9304680.3 

45 (B) FILING DATE: March 8, 1993 . ' " ' 

48 (A) APPLICATION NUMBER: 9311047.6 

49 (B) FILING DATE: May. 28, 1993 

52 (A) APPLICATION NUMBER: . 9313763 .6 

53 (B) FILING DATE: July 2, 1993 

56 (A) APPLICATION NUMBER: 9136099.2 

57 (B) FILING DATE : August 3, 1993 

60 (A) APPLICATION NUMBER: 321344.5 

61 (B) FILING DATE: October 15, 1993 ^ . . ' r. 

63 (viii) ATTORNEY/AGENT INFORMATION: 

64 (A) NAME: Mary Anne Schpfield 

65 (B) REGISTRATION .NUMBER: 36, 669 

66 (C) REFERENCE/DOCKET NUMBER: LUD 5539 CIP - JEL/MAS 
6 8 (ix) TELECOMMUNICATION' INFORMATION: 

69 (A) TELEPHONE: (202) 662-0200 

70 (B) TELEFAX: (202) 662-4643 
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RAW SEQUENCE LISTING DATE: 10/15/2001 

PATENT APPLICATION: US/09/039 , 177D TIME: 13:47:03 

Input Set : A:\Lud-5539 sequence listing.txt 
Output Set: N:\CRF3\10152001\I039177D.raw 
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ERRORED SEQUENCES 



E--> 



E--> 

E--> 

E--> 
W--> 



74 (2) INFORMATION FOR SEQ ID NO : 1: 

75 (i) SEQUENCE CHARACTERISTICS: 

76 (A) LENGTH: 1984 base pairs 

77 (B) TYPE: nucleic acid 

78 (C) STRANDEDNESS : unknown 

79 (D) TOPOLOGY: linear 
81 (ii) MOLECULE TYPE: cDNA 
83 (iii) HYPOTHETICAL: NO 

85 (iv) ANTI-SENSE: NO 

87 (v) FRAGMENT TYPE: internal 

89 (vi) ORIGINAL SOURCE: 

90 (A) ORGANISM: Homo sapiens 

92 (ix) FEATURE: 

93 (A) NAME/KEY: CDS 

94 (B) LOCATION: 283.. 1791 

96 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

98 AGGAAACGGT T TAT TAG GAG GGAGTGGTGG AGCTGGGCCA GGCAGGAAGA 

99 CGCTGGAATA 6 0 

101 AGAAACATTT TTGCTCCAGC CCCCATCCCA GTCCCGGGAG GCTGCCGCGC 



Does Mot Comply 
Corrected Diskette Needed 



102 CAGCTGCGCC 

104 GAGCGAGCCC 

105 AGCCCGCCGT 

107 CCAGCGCTGG 

108 GGTCCGCCGA 

110 AGGCTAGCGC 

111 294 
112 

113 
114 
115 
116 
118 



120 
CTCCCCGGCT 

180 
CGGTGCAACT 

240 
CCCGCCACCC 



CCAGCCCGGT CCGGGGCCGC GCCGGACCCC 



GCGGCCGCGC GGTGGAGGGG AGGTGGCCCC 



GCAGAGCGGG CCCAGAGGGA CC 



ATG GCC 
Met Ala 

15 
CTG GTG 




TCC CCC AGG AAA GGC CTT CTG ATG CTG CTG 

Ser Pro Arg Lys Gly Leu Leu Met Leu Leu 

5 10 

GGA GAC CCT GTG AAG CCG TCT CGG GGC CCG 

119 Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu Val 

120 25 30 
122 GAG AGC CCA CAT TGC AAG GGG CCT ACC TGC CGG GGG 
12 3 Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly 
124 40 45 

126 GTA GTG CTG GTG CGG GAG GAG GGG AGG CAC CCC CAG 

127 Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin Glu His Arg Gly 

128 ' 55 60 65 

130 TGC GGG AAC TTG CAC AGG GAG CTC TGC AGG GGG CGC CCC ACC GAG TTC 

131 Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg Pro Thr Glu Phe 

132 70 75 80 

134 GTC AAC CAC TAC TGC TGC GAC AGC CAC CTC TGC AAC CAC AAC GTG TCC 

135 Val Asn His Tyr Cys Cys Asp Ser His Leu Cys Asn His Asn Val Ser 

136 85 90 95 100 

138 CTG GTG CTG GAG GCC ACC CAA CCT CCT TCG GAG CAG CCG GGA ACA GAT 

139 Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin Pro Gly Thr Asp 



534 



582 



630 
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RAW SEQUENCE LISTING DATE: 10/15/2001 

PATENT APPLICATION: US/09/039 , 177D TIME: 13:47:03 



Input Set : A:\Lud-5539 sequence listing.txt 
Output Set: N:\CRF3\10152001\I039177D.raw 



140 










105 










110 










115 






142 


GGC 


CAG 


CTG 


GCC 


CTG 


ATC 


CTG 


GGC 


CCC 


GTG 


CTG 


GCC 


TTG 


CTG 


GCC 


CTG 


678 


14 3 


Glv 


Gin 


Leu 


Ala 


Leu 


He 


Leu 


Glv 


Pro 


Val 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 




144 








120 










125 










130 








146 


GTG 


GCC 


CTG 


GGT 


GTC 


CTG 


GGC 


CTG 


TGG 


CAT 


GTC 


CGA 


CGG 


AGG 


CAG 


GAG 


726 


147 


Val 


Ala 


Leu 


Glv 


Val 


Leu 


Glv 


Leu 


±± b' 


His 


Val 


Arcr 


y 


Ar cr 

t\± y 


Gin 


Glu 




148 






135 










140 










145 










150 


AAG 


CAG 


CGT 


GGC 


CTG 


CAC 


AGC 


GAG 


CTG 


GGA 


GAG 


TCC 


AGT 


CTC 


ATC 


CTG 


774 


151 


Lys 


Gin 


Arg 


Glv 


Leu 


His 


Ser 


Glu 


Leu 


Glv 


Glu 


Ser 


Ser 


Leu 


He 


Leu 




152 




150 










155 










160 












154 


AAA 


GCA 


TCT 


GAG 


CAG 


GGC 


GAC 


ACG 


ATG 


TTG 


GGG 


GAC 


CTC 


CTG 


GAC 


AGT 


822 


155 


Lys 


Ala 


Ser 


Glu 


Gin 


Glv 

JL 


Asp 


Thr 


Met 


Leu 


Glv 


Asp 


Leu 


Leu 


Asp 


Ser 




156 


165 










170 










175 










180 




158 


GAC 


TGC 


ACC 


ACA 


GGG 


AGT 


GGC 


TCA 


GGG 


CTC 


CCC 


TTC 


CTG 


GTG 


CAG 


AGG 


870 


159 


Asp 


Cys 


Thr 


Thr 


Glv 


Ser 


Glv 


Ser 


Glv 


Leu 


Pro 


Phe 


Leu 


Val 


Gin 


Arcr 




160 










185 










190 










195 






162 


ACA 


GTG 


GCA 


CGG 


CAG 


GTT 


GCC 


TTG 


GTG 


GAG 


TGT 


GTG 


GGA 


AAA 


GGC 


CGC 


918 


163 


Thr 


Val 


Ala 


Arg 


Gin 


Val 


Ala 


Leu 


Val 


Glu 


Cys 


Val 


Glv 


Lys 


Glv 

\j j_ 2 


Arcr 




164 








200 










205 










210 








166 


TAT 


GGC 


GAA 


GTG 


TGG 


CGG 


GGC 


TTG 


TGG 


CAC 


GGT 


GAG 


AGT 


GTG 


GCC 


GTC 


966 


167 


Tvr 
± y • L 


Glv 


Glu 


Val 


TrD 

c 


Arg 


Glv 


Leu 


TrD 

c 


His 


Glv 


Glu 


Ser 


Val 


Ala 


Val 




168 






215 










220 










225 










170 


AAG 


ATC 


TTC 


TCC 


TCG 


AGG 


GAT 


GAA 


CAG 


TCC 


TGG 


TTC 


CGG 


GAG 


ACT 


GAG 


1014 


171 


Lys 


lie 


Phe 


Ser 


Ser 




Asp 


Glu 


Gin 


Ser 




Phe 


Arcr 

t\± y 


Glu 


Thr 


Glu 




172 




230 










235 










240 












174 


ATC 


TAT 


AAC 


ACA 


GTA 


TTG 


CTC 


AGA 


CAC 


GAC 


AAC 


ATC 


CTA 


GGC 


TTC 


ATC 


1062 


175 


lie 




Asn 


Thr 


Val 


Leu 


Leu 


Arg 


His 


Asp 


Asn 


He 


Leu 


Glv 


Phe 


He 




176 


245 










250 










255 










260 




178 


GCC 


TCA 


GAC 


ATG 


ACC 


TCC 


CGC 


AAC 


TCG 


AGC 


ACG 


CAG 


CTG 


TGG 


CTC 


ATC 


1110 


179 


Ala 


Sei* 


Asp 


Met 


Thr 


Ser 


Ara 

«.i y 


Asn 


Ser 


Ser 


Thr 


Gin 


Leu 


TrD 


Leu 


He 




180 










265 










270 










275 






182 


ACG 


CAC 


TAC 


CAC 


GAG 


CAC 


GGC 


TCC 


CTC 


TAC 


GAC 


TTT 


CTG 


CAG 


AGA 


CAG 


1158 


183 


Thr 


His 


Tvr 


His 


Glu 


His 


Glv 


Ser 


Leu 


Tvr 


Asp 


Phe 


Leu 


Gin 


Arg 


Gin 




184 








280 










285 










290 








186 


ACG 


CTG 


GAG 


CCC 


CAT 


CTG 


GCT 


CTG 


AGG 


CTA 


GCT 


GTG 


TCC 


GCG 


GCA 


TGC 


1206 


187 


Thr 


Leu 


Glu 


Pro 


His 


Leu 


Ala 


Leu 


Arcr 


Leu 


Ala 


Val 


Ser 


Ala 


Ala 


Cys 




188 






295 










300 










305 










1 QO 
J. _? V 




V_ J. Vj 


czccz 


par 




PAP 






A rnp 


± 


\J\J J. 






czczc 


AAA 


PPA 


1 9 

_L Zi -J *± 


191 


Gly 


Leu 


Ala 


His 


Leu 


His 


Val 


Glu 


He 


Phe 


Gly 


Thr 


Gin 


Gly 


Lys 


Pro 




192 




310 










315 










320 












194 


GCC 


ATT 


GCC 


CAC 


CGC 


GAC 


TTC 


AAG 


AGC 


CGC 


AAT 


GTG 


CTG 


GTC 


AAG 


AGC 


1302 


195 


Ala 


He 


Ala 


His 


Arg 


Asp 


Phe 


Lys 


Ser 


Arg 


Asn 


Val 


Leu 


Val 


Lys 


Ser 




196 


325 










330 










335 










340 




198 


AAC 


CTG 


CAG 


TGT 


TGC 


ATC 


GCC 


GAC 


CTG 


GGC 


CTG 


GCT 


GTG 


ATG 


CAC 


TCA 


1350 


199 


Asn 


Leu 


Gin 


Cys 


Cys 


He 


Ala 


Asp 


Leu 


Gly 


Leu 


Ala 


Val 


Met 


His 


Ser 




200 










345 










350 










355 






202 


CAG 


GGC 


AGC 


GAT 


TAC 


CTG 


GAC 


ATC 


GGC 


AAC 


AAC 


CCG 


AGA 


GTG 


GGC 


ACC 


1398 


203 


Gin 


Gly 


Ser 


Asp 


Tyr 


Leu 


Asp 


He 


Gly 


Asn 


Asn 


Pro 


Arg 


Val 


Gly 


Thr 




204 








360 










365 










370 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/039 , 177D 



DATE: 10/15/2001 
TIME: 13:47:03 



E--> 
W--> 

W--> 



W- 



w- 



w--> 



w- 



w--> 



w--> 

E--> 

w--> 
w--> 



E--> 



E--> 



206 
207( 

208 
209 
211 
212 
213 
215 
216 
217 
219 
220 
221 
223 
224 
225 
227 
228 
229 
231 
232 
233 
235 
236 
237 
238 
'240 
241 
243 
244 
246 
247 
249 
329 
330 
331 
332 
333 
334 
336 
338 



Input Set : A:\Lud-5539 sequence listing.txt 
Output Set: N:\CRF3\10152001\I039177D.raw 




TGC TTT 
Cys Phe 
390 
CTG TGG 
Leu Trp 
405 

TAT AGA 



375 

GAG 
Glu 



Glu 



CCA 



GAC ATG 



AAC CGG 
Asn Arg 



Leu 
455 
TGC 



CGG GAG 
Arg Glu Cys 

470 
ATC AAG AAG 
1782 

lie Lys Lys Thr Leu 
485 

GTG ATT 
Val lie 



ATG 


GCA 


CCC 


GAG 


GTG 


CTG 


GAC 


GAG 


CAG 


ATC 


CGC 


ACG 


GAC 


Met 


Ala 


Pro 


Glu 


Val 


Leu 


Asp 


Glu 


Gin 


lie 


Arcr 


Thr 


Asp 










380 










385 








TCC 


TAC 


AAG 


TGG 


ACT 


GAC 


ATC 


TGG 


GCC 


TTT 


GGC 


CTG 


GTG 


Ser 


Tvr 


Lys 


TrD 


Thr 


Asp 


lie 


Tro 


Ala 


Phe 


Glv 


Leu 


Val 








395 










400 










ATT 


GCC 


CGC 


CGG 


ACC 


ATC 


GTG 


AAT 


GGC 


ATC 


GTG 


GAG 


GAC 


lie 


Ala 


Arcr 


Arcr 


Thr 


lie 


Val 


Asn 


Glv 
\j j. j 


lie 


Val 


Glu 


Asp 






410 










415 










420 


CCC 


TTC 


TAT 


GAT 


GTG 


GTG 


CCC 


AAT 


GAC 


CCC 


AGC 


TTT 


GAG 


Pro 


Phe 


Tvr 

JL 


Asp 


Val 


Val 


Pro 


Asn 


Asp 


Pro 


Ser 


Phe 


Glu 




425 










430 










435 




AAG 


GTG 


GTG 


TGT 


GTG 


GAT 


CAG 


CAG 


ACC 


CCC 


ACC 


ATC 


CCT 


Lys 


Val 


Val 


Cys 


Val 


Asp 


Gin 


Gin 


Thr 


Pro 


Thr 


He 


Pro 


440 










445 










450 






GCT 


GCA 


GAC 


CCG 


GTC 


CTC 


TCA 


GGC 


CTA 


GCT 


CAG 


ATG 


ATG 


Ala 


Ala 


Asp 


Pro 


Val 


Leu 


Ser 


Gly 


Leu 


Ala 


Gin 


Met 


Met 










460 










465 








TGG 


TAC 


CCA 


AAC 


CCC 


TCT 


GCC 


CGA 


CTC 


ACC 


GCG 


CTG 


CGG 


Trp 


Tyr 


Pro 


Asn 


Pro 


Ser 


Ala 


Arg 


Leu 


Thr 


Ala 


Leu 


Arg 








475 










480 










ACA 


CTA 


CAA 


AAA 


ATT 


AGC 


AAC 


AGT 


CCA 


GAG 


AAG 


CCT 


AAA 



Gin Lys He Ser Asn 
490 

TAGCCCAGGA GCACCTGATT 



Ser Pro Glu Lys Pro Lys 
495 500 
CCTTTCTGCC TGCAGGGGGC 



C--J(^340^^^v 



CAA 
Gin 

TGGGGGGGTG GGGGGCAGTG GATGGTGCCC TATCTGGGTA GAGGTAGTGT 
GAGTGTGGTG 1891 

TGTGCTGGGG ATGGGCAGCT GCGCCTGCCT GCTCGGCCCC CAGCCCACCC 
AGCCAAAAAT 19 51 

ACAGCTGGGC TGAAACCTGA AAAAAAAAAA AAA 
(2) INFORMATION FOR SEQ ID NO : 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHFTT^flT.; FT 



344 
345 
347 
348 
349 
351 



(v) 
(vi) 

(ix) 



(xi) 



ANTI-SEN SE: NO 
" FRAGMENT-TYf^ 



1494 



1542 



1590 



1638 



1686 



1734 



1831 



1984 




internal 
ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 104.. 1630 
SEQUENCE DESCRIPTION: SEQ ID 



NO: 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/039 , 177D 



DATE: 10/15/2001 
TIME: 13:47:04 



Input Set : A:\Lud-5539 sequence listing.txt 
Output Set: N:\CRF3\10152001\I039177D.raw 



E--> 353^TCCGAGT2 

3543 CGGCTTGAAG 
E--> 356 
W--> 357 115 

358 
359 

361 GTG ATG ATT 

362 Val Met He 

363 5 

36 5 ATG GAA GAT 
366 Met Glu Asp 
367 

369 TGT GAA GGT 

370 Cys Glu Gly 
371 

37 3 TGC TTT TCC 

374 Cys Phe Ser 

375 55 

377 GGC TGC TTC 

378 Gly Cys Phe 

379 70 

381 CCG TCC CCT 

382 Pro Ser Pro 

383 85 

385 AGG AAC ATC 

386 Arg Asn He 
387 

389 ACA CAG AAT 

390 Thr Gin Asn 
391 

393 TTC GCA GTA 
Phe Ala Val 



kCC AGAGTGAGAG AAGCTCTGAA CGAGGGCACG 



;TGACC XaGAGCCTGC^TTAAGTTGTACA ATG GTA GAT GGA 




394 
395 



135 



397 TTT AAA AGG 

398 Phe Lys Arg 

399 150 

4 01 GGC ACT ATC 

402 Gly Thr He 

403 165 

4 05 GCA GAT TTA 

406 Ala Asp Leu 
407 

4 09 CCT TTT CTG 

410 Pro Phe Leu 
411 

413 TGT GTC GGG 
Cys Val Gly 
215 



CTT 
Leu 

GAG 
Glu 

CTC 
Leu 
40 
TCA 
Ser 

CAG 
Gin 

GGC 
Gly 

ACG 
Thr 

TTC 
Phe 
120 
TGT 
Cys 

CGC 
Arg 

GAA 
Glu 

TTG 
Leu 

GTA 
Val 
200 
AAA 
Lys 



CCT 
Pro 

AAG 
Lys 
25 
TCC 
Ser 



GTG 
Val 
10 
CCC 
Pro 

TGC 
Cys 



CTG AGC 
Leu Ser 

GTT TAT 
Val Tyr 



CAA 
Gin 

GCC 
Ala 
105 
CAC 
His 



GCT 
Ala 
90 
CAG 
Gin 

TTG 
Leu 



CTT TTA 
Leu Leu 

AAC CAA 
Asn Gin 



GGG 
Gly 

GAT 
Asp 
185 
CAA 
Gin 



CTC 
Leu 
170 
CAT 
His 

AGA 
Arg 



GGC AGG 
Gly Arg 



CTT 
Leu 

AAG 
Lys 

GGT 
Gly 

ATC 
He 

GAG 
Glu 
75 
GTG 
Val 

CTG 
Leu 

GAG 
Glu 

GCC 
Ala 

GAA 
Glu 
155 
ATC 
lie 

TCG 
Ser 

ACA 
Thr 

TAT 
Tyr 



414 
415 

417 GGG GAA AAT GTT GCC GTG AAG 



ATC 
He 

GTC 
Val 

AAT 
Asn 

AAC 
Asn 
60 
CAG 
Gin 

GAG 
Glu 

CCC 
Pro 

GTT 
Val 

TGC 
Cys 
140 
CGC 
Arg 

ACC 
Thr 

TGT 
Cys 

GTG 
Val 

GGT 
Gly 
220 
ATC 



ATG 
Met 

AAC 
Asn 

GAG 
Glu 
45 
GAT 
Asp 

GGA 
Gly 

TGC 
Cys 

ACT 
Thr 

GGC 
Gly 
125 
CTG 
Leu 

CTC 
Leu 

ACC 
Thr 

ACA 
Thr 

GCT 
Ala 
205 
GAG 
Glu 



ATT 
He 

CCC 
Pro 
30 
GAC 
Asp 

GGC 
Gly 

AAG 
Lys 

TGC 
Cys 

AAA 
Lys 
110 
CTC 
Leu 

CTG 
Leu 

AAT 
Asn 

AAT 
Asn 

TCA 
Ser 
190 
CGC 
Arg 

GTG 
Val 



GCT 
Ala 
15 
AAA 
Lys 

CAC 
His 

TTC 
Phe 

ATG 
Met 

CAA 
Gin 
95 
GGA 
Gly 

ATT 
He 

GGA 
Gly 

CCC 
Pro 

GTT 
Val 
175 
GGA 
Gly 

CAG 
Gin 

TGG 
Trp 



CTC 
Leu 

CTC 
Leu 

TGT 
Cys 

CAC 
His 

ACC 
Thr 
80 
GGG 
Gly 

AAA 
Lys 

ATT 
He 

GTT 
Val 

CGA 
Arg 
160 
GGA 
Gly 

AGT 
Ser 

ATT 
He 

AGG 
Arg 



TTC TCC TCC CGT 



Met 
1 

CCC 
Pro 

TAC 
Tyr 

GAA 
Glu 

GTC 
Val 
65 
TGT 
Cys 

GAC 
Asp 

TCC 
Ser 

CTC 
Leu 

GCT 
Ala 
145 
GAC 
Asp 

GAC 
Asp 

GGC 
Gly 

ACA 
Thr 

GGC 
Gly 
225 
GAT 



Val Asp Gly 



TCC 
Ser 

ATG 
Met 

GGC 
Gly 
50 
TAC 
Tyr 



CCT 
Pro 

TGT 
Cys 
35 
CAG 
Gin 



AGT 
Ser 
20 
GTG 
Val 

CAG 
Gin 



CAG AAA 
Gin Lys 



AAG ACC CCG 
Lys Thr Pro 



TGG 
Trp 

TTC 
Phe 

TCT 
Ser 
130 
CTC 
Leu 



TGT 
Cys 

CCT 
Pro 
115 
GTA 
Val 



AAC 
Asn 
100 
GGA 
Gly 

GTG 
Val 



CGA AAA 
Arg Lys 



GTG GAG TAT 
Val Glu Tyr 



AGC 
Ser 

TCT 
Ser 

CTG 
Leu 
210 
AGC 
Ser 



ACT 
Thr 

GGT 
Gly 
195 
TTG 
Leu 



TTA 
Leu 
180 
CTT 
Leu 

GAG 
Glu 



TGG CAA 
Trp Gin 



GAG AAG TCA 



163 



211 



259 



307 



355' 



403 



451 



499 



547 



595 



643 



691 



739 



787 



835 
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418 


Gly 


Glu 


Asn 


Val 


Ala 


Val 


Lys 


He 


Phe' 


Ser 


Ser 


Arq 


Asp 


Glu 


Lys 


Ser 






419 




230 










235 










240 














421 


TGG 


TTC 


AGG 


GAA 


ACG 


GAA 


TTG 


TAC 


AAC 


ACT 


GTG 


ATG 


CTG 


AGG 


CAT 


GAA 






422 


Trp 


Phe 


Arq 


Glu 


Thr 


Glu 


Leu 


Tvr 


Asn 


Thr 


Val 


Met 


Leu 


Arg 


His 


Glu 






423 


245. 










250 










255 










260 






425 


AAT 


ATC 


TTA 


GGT 


TTC 


ATT 


GCT 


TCA 


GAC 


ATG 


ACA 


TCA 


AGA 


CAC 


TCC 


AGT 






426 


Asn 


He 


Leu 


Gly 


Phe 


He 


Ala 


Ser 


Asp 


Met 


Thr 


Ser 


Arg 


His 


Ser 


Ser 






427 










265 










270 










275 








429 


ACC 


CAG 


CTG 


TGG 


TTA 


ATT 


ACA 


CAT 


TAT 


CAT 


GAA 


ATG 


GGA 


TCG 


TTG 


TAC 






430 


Thr 


Gin 


Leu 


Trp 


Leu 


He 


Thr 


His 


Tyr 

2 


His 


Glu 


Met 


Gly 

2 


Ser 


Leu 


Tyr 






431 








280 










285 










290 










433 


GAC 


TAT 


CTT 


CAG 


CTT 


ACT 


ACT 


CTG 


GAT 


ACA 


GTT 


AGC 


TGC 


CTT 


CGA 


ATA 






434 


Asp 


Tyr 


Leu 


Gin 


Leu 


Thr 


Thr 


Leu 


Asp 


Thr 


Val 


Ser 


Cys 


Leu 


Arg 


He 






435 






295 










300 










305 












437 


GTG , CTG 


TCC 


ATA 


GCT 


AGT 


GGT 


CTT 


GCA 


CAT 


TTG 


CAC 


ATA 


GAG 


ATA 


TTT 






438 


Val 


Leu 


Ser 


He 


Ala 


Ser 


Gly 


Leu 


Ala 


His 


Leu 


His 


He 


Glu 


He 


Phe 






439 




310 










315 










320 










E- 


-> 


441. 




ACC 


CAA 


GGG 


AAA 


CCA 


GCC 


ATT 


GCC 


CAT 


CGA 


GAT 


TTA 


AAG 


AGC 


AAA 


W- 


- > 


442^ 




































443 




-Thr 


Gin 


Gly 

J: 


Lys 


Pro 


Ala 


He 


Ala 


His 


Arq 


Asp 

IT 


Leu 


Lys 


Ser 


Lys 


w- 


-> 


444 




AiTB^QTG 






330 










335 










340 






446 


AAT 


GTT 


AAG 


AAG 


AAT 


GGA 


CAG 


TGT 


TGC 


ATA 


GCA 


GAT 


TTG 


GGC 


w- 


-> 


447 
448 


Asn 


lie 


Leu 


Val 


Lvs 
345 


Lvs 


Asn 




GUI 


CVs 
350 


TTVs" 


"Tlbr 


Ala- 




L eu 
355 


Gly^ 






450 


CTG 


GCA 


GTC 


ATG 


CAT 


TCC 


CAG 


AGC 


ACC 


AAT 


CAG 


CTT 


GAT 


GTG 


GGG 


AAC 






451 


Leu 


Ala 


Val 


Met 


His 


Ser 


Gin 


Ser 


Thr 


Asn 


Gin 


Leu 


Asp 


Val 


Gly 


Asn 


w- 


-> 


452 








360 










365 










370 










454 


AAT 


CCC 


CGT 


GTG 


GGC 


ACC 


AAG 


CGC 


TAC 


ATG 


GCC 


CCC 


GAA 


GTT 


CTA 


GAT 






455 


Asn 


Pro 


Arg 


Val 


Gly 


Thr 


Lvs 


Arcr 


Tvr 


Met 


Ala 


Pro 


Glu 


Val 


Leu 


Asp 


w- 


-> 


456 






375 










380 










385 












458 


GAA 


ACC 


ATC 


CAG 


GTG 


GAT 


TGT 


TTC 


GAT 


TCT 


TAT 


AAA 


AGG 


GTC 


GAT 


ATT 






459 


Glu 


Thr 


He 


Gin 


Val 


Asp 


Cys 


Phe 


Asp 


Ser 


Tvr 


Lvs 


Arq 


Val 


Asp 

IT 


He 


w- 


-> 


460 




390 










395 










400 














462 


TGG 


GCC 


TTT 


GGA 


CTT 


GTT 


TTG 


TGG 


GAA 


GTG 


GCC 


AGG 


CGG 


ATG 


GTG 


AGC 






463 


Trp 


Ala 


Phe 


Gly 


Leu 


Val 


Leu 


Trp 

IT 


Glu 


Val 


Ala 


Arg 


Arg 


Met 


Val 


Ser 


w- 


-> 


464 


405 










410 










415 










420 






466 


AAT 


GGT 


ATA 


GTG 


GAG 


GAT 


TAC 


AAG 


CCA 


CCG 


TTC 


TAC 


GAT 


GTG 


GTT 


CCC 






467 


Asn 


Gly 


He 


Val 


Glu 


Asp 


Tvr 


Lys 


Pro 


Pro 


Phe 


Tvr 


Asp 


Val 


Val 


Pro 


w- 


-> 


468 










425 










430 










435 








470 


AAT 


GAC 


CCA 


AGT 


TTT 


GAA 


GAT 


ATG 


AGG 


AAG 


GTA 


GTC 


TGT 


GTG 


GAT 


CAA 






471 


Asn 


Asp 


Pro 


Ser 


Phe 


Glu 


Asp 


Met 


Arg 


Lys 


Val 


Val 


Cys 


Val 


Asp 


Gin 


w- 


-> 


472 








440 










445 










450 










474 


CAA 


AGG 


CCA 


AAC 


ATA 


CCC 


AAC 


AGA 


TGG 


TTC 


TCA 


GAC 


CCG 


ACA 


TTA 


ACC 






475 


Gin 


Arg 


Pro 


Asn 


He 


Pro 


Asn 


Arg 


Trp 


Phe 


Ser 


Asp 


Pro 


Thr 


Leu 


Thr 


w- 


-> 


476 






455 










460 










465 












478 


TCT 


CTG 


GCC 


AAG 


CTA 


ATG 


AAA 


GAA 


TGC 


TGG 


TAT 


CAA 


AAT 


CCA 


TCC 


GCA 






479 


Ser 


Leu 


Ala 


Lys 


Leu 


Met 


Lys 


Glu 


Cys 


Trp 


Tyr 


Gin 


Asn 


Pro 


Ser 


Ala 


w- 


-> 


480 




470 










475 










480 














482 


AGA 


CTC 


ACA 


GCA 


CTG 


CGT 


ATC 


AAA 


AAG 


ACT 


TTG 


ACC 


AAA 


ATT 


GAT 


AAT 



883 



931 



979 



1027 



1075 




1267 



1315 



1363 



1411 



1459 



1507 



1555 



1603 
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483 Arg Leu Thr Ala Leu Arglle Lys Lys Thr Leu Thr Lys lie Asp Asn 

W--> 484 485 490 495 500 

4 86 TCC CTC GAC AAA TTG AAA ACT GAC TGT TGACATTTTC ATAGTGTCAA 16 50 

4 87 Ser Leu Asp Lys Leu Lys Thr Asp Cys 
W--> 488 505 

E--> 49Jy^€AAGGi^^^t- I ^TQ^sGG^iXGT TGTCATTGTC CAGCTGGGAC CTAATGCTGG 
(91 CCTGACTGGT 1710 




E--> TCTrannBTC canT^nTPT^JcTrTrrPTrr CCAAATGGCT GCTTTGACAA 

494 GGCAGACGTC 1770 

E--> 496 GTACCCAGCC ATGTGTTGGG GAG AC AT C AA AACCACCCTA ACCTCGCTCG 

- 4 97 ATGACTGTGA 1830 

E--> 499 ACTGGGCATT TCACGAACTG TTCACACTGC AGAGACTAAT GTTGGACAGA 

500 CACTGTTGCA 1890 

E--> 502 AAGGTAGGGA CTGGAGGAAC ACAGAGAAAT CCTAAAAGAG ATCTGGGCAT 

503 TAAGTCAGTG 1950 

E--> 505 GCTTTGCATA GCTTTCACAA GTCTCCTAGA CACTCCCCAC GGGAAACTCA 

506 AGGAGGTGGT 2 010 

E--> 508 GAATTTTTAA TCAGCAATAT TGCCTGTGCT TCTCTTCTTT ATTGCACTAG 

509 GAATTCTTTG 2 070 

E--> 511 CATTCCTTAC TTGCACTGTT ACTCTTAATT TTAAAGACCC AACTTGCCAA 

512 AATGTTGGCT 2130 

E--> 514 GCGTACTCCA CTGGTCTGTC TTTGGATAAT AGGAATTCAA TTTGGCAAAA 

515 CAAAATGTAA 2190 

E--> 517 TGTCAGACTT TGCTGCATTT TACACATGTG CTGATGTTTA CAATGATGCC 

518 GAACATTAGG 22 50 

E--> 520 AATTGTTTAT ACACAACTTT GCAAATTATT TATTACTTGT GCACTTAGTA 

521 GTTTTTACAA 2 310 

E--> 523 AACTGCTTTG TGCATATGTT AAAGCTTATT TTTATGTGGT CTTATGATTT 

524 TATTACAGAA 2 370 

E--> 526 ATGTTTTTAA CACTATACTC TAAAATGGAC ATTTTCTTTT ATTATCAGTT 

527 AAAATCACAT 24 30 

E--> 529 TTTAAGTGCT TCACATTTGT ATGTGTGTAG ACTGTAACTT TTTTTCAGTT 

530 CATATGCAGA 24 90 

E--> 532 ACGTATTTAG CCATTACCCA CGTGACACCA CCGAATATAT TATCGATTTA 

533 GAAGCAAAGA 2 550 

E--> 535 TTTCAGTAGA ATTTTAGTCC TGAACGCTAC GGGGAAAATG CATTTTCTTC 

536 AGAATTATCC 2610 

E--> 538 ATTACGTGCA TTTAAACTCT GCCAGAAAAA AATAACTATT TTGTTTTAAT 

539 CTACTTTTTG 2 670 s 

E--> 541 TATTTAGTAG /TTATTTGTM AAATTAAA^A" AACTGTTTTG^AAGTCAAAAA^AAAA 
W--> 542 2724 

621 (2) INFORMATION FOR SEQ ID NO: 5: 

622 (i) SEQUENCE CHARACTERISTICS: 

623 (A) LENGTH: 2932 base pairs 

624 (B) TYPE: nucleic acid 

625 (C) STRANDEDNESS : unknown 
62 6 (D) TOPOLOGY: linear 

628 (ii) MOLECULE TYPE: cDNA 

630 (iii) HYPOTHETICAL: NO 
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632 




(iv) 


ANTI-SENSE: 


NO 
























634 




(v) 


FRAGMENT TYPE: 


internal 




















636 




(Vi) 


ORIGINAL SOURCE: 






















637 






(A) ORGANISM: 


Homo sapiens 
















639 




(ix) 


FEATURE : 


























640 






(A) NAME/KEY: 


CDS 






















641 






(B) LOCATION: 


310. 


.1905 


















64 3 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 5: 














U*l J 


GCTCCGCGCC GAGGGCTGGA GGATGCGTTC 


: CCTGGGGTCC 


GGACTTATGA 








646 


AAATATGCAT 




60 


























\J *t o 


CAGTTTAATA CTGTCTTGGA ATTCATGAGA TGGAAGCATA 


GGTCAAAGCT 








649 


GTTTGGAGAA 


120 


























O J J. 


AATCAGAAGT ACAGTTTTAT CTAGCCACAT 


1 CTTGGAGGAG 


TCGTAAGAAA 








652 


GCAGTGGGAG 


180 


























U J ** 


TTGAAGTCAT TGTCAAGTGC TTGCGATCTT 


' TTACAAGAAA 


ATCTCACTGA 








D J J 


ATGATAGTCA 


240 


























O J / 


TTTAAATTGG TGAAGTAGCA AGACCAATTA TTAAAGGTGA 


CAG TAC AC AG 








658 


GAAACATTAC 


300 


























D D \J 


AATTGAACA ATG ACT CAG CTA TAC ATT TAC ATC AGA TTA TTG GGA GCC 


348 


O D _L 






Met Thr Gin Leu Tyr He Tyr He Arg Leu Leu Gly Ala 




662 








1 








5 








10 










664 


TAT 


TTG 


TTC 


ATC 


ATT 


TCT 


CGT 


GTT 


CAA 


GGA 


CAG 


AAT 


CTG 


GAT 


AGT 


ATG 


396 


^ 

DO J 


Tyr 


Leu 


Phe 


He 


He 


Ser 


Arg 


Val 


Gin 


Gly 


Gin 


Asn 


Leu 


Asp 


Ser 


Met 




D O D 




15 










20 










25 












D O O 


CTT 


CAT 


GGC 


ACT 


GGG 


ATG 


AAA 


TCA 


GAC 


TCC 


GAC 


CAG 


AAA 


AAG 


TCA 


GAA 


444 


fi 9 
D D J 


Leu 


His 


Gly 


Thr 


Gly 


Met 


Lys • 


Ser 


Asp 


Ser 


Asp 


Gin 


Lys 


Lys 


Ser 


Glu 




670 


30 










35 










40 










45 




672 


AAT 


GGA 


GTA 


ACC 


TTA 


GCA. 


CCA 


GAG 


GAT 


ACC 


TTG 


CCT 


TTT 


TTA 


AAG 


TGC 


492 


fi7 "3 


Asn 


Gly 


Val 


Thr 


Leu 


Ala 


Pro 


Glu 


Asp 


Thr 


Leu 


Pro 


Phe 


Leu 


Lys 


Cys 




674 










50 










55 










60 






fi7fi 


TAT 


TGC 


TCA 


GGG 


CAC 


TGT 


CCA 


GAT 


GAT 


GCT 


ATT 


AAT 


AAC 


ACA 


TGC 


ATA 


540 


fi77 


Tyr 


Cys 


Ser 


Gly 


His 


Cys 


Pro 


Asp 


Asp 


Ala 


He 


Asn 


Asn 


Thr 


Cys 


He 




678 








65 










70 










75 








o o V 


ACT 


AAT 


GGA 


CAT 


TGC 


TTT 


GCC 


ATC 


ATA 


GAA 


GAA 


GAT 


GAC 


CAG 


GGA 


GAA 


588 


O O _L 


Thr 


Asn 


Gly 


His 


Cys 


Phe 


Ala 


He 


He 


Glu 


GlU 


Asp 


Asp 


Gin 


Gly 


Glu 




682 






80 










85 










90 










ft A 
DO** 


ACC 


ACA 


TTA 


GCT 


TCA 


GGG 


TGT 


ATG 


AAA 


TAT 


GAA 


GGA 


TCT 


GAT 


TTT- 


CAG 


636 


DO J 


Thr 


Thr 


Leu 


Ala 


Ser 


Gly 


Cys 


Met 


Lys 


Tyr 


Glu 


Gly 


Ser 


Asp 


Phe 


Gin 




686 




95 










100 










105 












688 


TGC 


AAA 


GAT 


TCT 


CCA 


AAA 


GCC 


CAG 


CTA 


CGC 


CGG 


ACA 


ATA 


GAA 


TGT 


TGT 


684 


689 


Cys 


Lys 


Asp 


Ser 


Pro 


Lys 


Ala 


Gin 


Leu 


Arg 


Arg 


Thr 


He 


Glu 


Cys 


Cys 




690 


110 










115 










120 










125 




692 


CGG 


ACC 


AAT 


TTA 


TGT 


AAC 


CAG 


TAT 


TTG 


CAA 


CCC 


ACA 


CTG 


CCC 


CCT 


GTT 


732 


693 


Arg 


Thr 


Asn 


Leu 


Cys 


Asn 


Gin 


Tyr 


Leu 


Gin 


Pro 


Thr 


Leu 


Pro 


Pro 


Val 




694 










130 










135 










140 






696 


GTC 


ATA 


GGT 


CCG 


TTT 


TTT 


GAT 


GGC 


AGC 


ATT 


CGA 


TGG 


CTG 


GTT 


TTG 


CTC 


780 


697 


Val 


He 


Gly 


Pro 


Phe 


Phe 


Asp 


Gly 


Ser 


He 


Arg 


Trp 


Leu 


Val 


Leu 


Leu 




698 








145 










150 










155 








700 


ATT 


TCT 


ATG 


GCT 


GTC 


TGC 


ATA 


ATT 


GCT 


ATG 


ATC 


ATC 


TTC 


TCC 


AGC 


TGC 


828 
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701 


lie 


Ser 


Met 


Ala 


Val 


Cys 


He 


He 


Ala 


Met 


He 


He 


Phe 


Ser 


Ser 


Cvs 








702 






160 










165 










170 














704 


TTT 


TGT 


TAC 


AAA 


CAT 


TAT 


TGC 


AAG 


AGC 


ATC 


TCA 


AGC 


AGA 


CGT 


CGT 


TAC 


876 






705 


Phe 


Cys 


Tvr 


Lys 


His 


Tvr 


Cvs 


Lvs 


Ser 


He 


Ser 


Ser 


Arcr 


Arg 


Arcr 


Tvr 








706 




175 










180 










185 
















708 


AAT 


CGT 


GAT 


TTG 


GAA 


CAG 


GAT 


GAA 


GCA 


TTT 


ATT 


CCA 


GTT 


GGA 


GAA 


TCA 


924 






709 


Asn 


Arg 


Asp 


Leu 


Glu 


Gin 


Asp 


Glu 


Ala 


Phe 


He 


Pro 


Val 


Glv 


Glu 


Ser 








710 


190 










195 










200 










205 








712 


CTA 


AAA 


GAC 


CTT 


ATT 


GAC 


CAG 


TCA 


CAA 


AGT 


TCT 


GGT 


AGT 


GGG 


TCT 


GGA 


972 






713 


Leu 


Lys 


Asp 


Leu 


He 


Asp 


Gin 


Ser 


Gin 


Ser 


Ser 


Gly 


Ser 


Gly 


Ser 


Gly 








714 










210 










215 










220 










716 


CTA 


CCT 


TTA 


TTG 


GTT 


CAG 


CGA 


ACT 


ATT 


GCC 


AAA 


CAG 


ATT 


CAG 


ATG 


GTC 


1020 






717 


Leu 


Pro 


Leu 


Leu 


Val 


Gin 


Arcr 


Thr 


He 


Ala 


Lvs 


Gin 


He 


Gin 


Met 


Val 








718 








225 










230 










235 








E- 


-> 


720 


CGG 


CAA 


GTT 


GGT 


AAA 


GGC 


CGA 


TAT 


GGA 


GAA 


GTA 


TGG 


ATG 


GGC 


AAA 


TGG 




W- 


-> 


721 


1068 




































722 


Arg 


Gin 


Val 


Glv 


Lvs 


Glv 


Arcr 


Tvr 


Gly 


Glu 


Val 


Trp 


Met 


Gly 


Lys 


Trp 




W- 


- > 


723 






240 










245 










250 














725 


CGT 


GGC 


GAA 


AAA 


GTG 


GCG 


GTG 


AAA 


GTA 


TTC 


TTT 


ACC 


ACT 


GAA 


GAA 


GCC 


1116 






726 


Arg 


Gly 


Glu 


Lys 


Val 


Ala 


Val 


Lys 


Val 


Phe 


Phe 


Thr 


Thr 


GlU 


Glu 


Ala 




W- 


- > 


727 




255 










260 










265 
















729 


AGC 


TGG 


TTT 


CGA 


GAA 


ACA 


GAA 


ATC 


TAC 


CAA 


ACT 


GTG 


CTA 


ATG 


CGC 


CAT 


1164 






730 


Ser 


Trp 


Phe 


Arcr 


Glu 


Thr 


Glu 


He 


Tvr 

JL 


Gin 


Thr 


Val 


Leu 


Met 


Arcr 


His 




W- 


- > 


731 


270 










275 










280 










285 








733 


GAA 


AAC 


ATA 


CTT 


GGT 


TTC 


ATA 


GCG 


GCA 


GAC 


ATT 


AAA 


GGT 


ACA 


GGT 


TCC 


1212 






734 


Glu 


Asn 


lie 


Leu 


Glv 


Phe 


He 


Ala 


Ala 


Asp 


He 


Lys 


Glv 

JL 


Thr 


Glv 


Ser 




W- 


- > 


735 










290 










295 










300 










737 


TGG 


ACT 


CAG 


CTC 


TAT 


TTG 


ATT 


ACT 


GAT 


TAC 


CAT 


GAA 


AAT 


GGA 


TCT 


CTC 


1260 






738 


Trp 


Thr 


Gin 


Leu 


Tvr 


Leu 


He 


Thr 


Asp 


Tvr 

JL 


His 


Glu 


Asn 


Glv 


Ser 


Leu 




W- 


- > 


739 








305 










310 










315 












74.1 


TAT 


GAC 


TTC 


CTG 


AAA 


TGT 


GCT 


ACA 


CTG 


GAC 


ACC 


AGA 


GCC 


CTG 


CTT 


AAA 


1308 






742 


Tyr 


Asp 


Phe 


Leu 


Lys 


Cys 


Ala 


Thr 


Leu 


Asp 


Thr 


Arcr 


Ala 


Leu 


Leu 


Lys 




W- 


_ > 


743 






320 










325 










330 














745 


TTG 


GCT 


TAT 


TCA 


GCT 


GCC 


TGT 


GGT 


CTG 


TGC 


CAC 


CTG 


CAC 


ACA 


GAA 


ATT 


1356 






74 6 


Leu 


Ala 


Tvr 


Ser 


Ala 


Ala 


Cys 


Glv 


Leu 


Cys 


His 


Leu 


His 


Thr 


Glu 


He 




W- 


- > 


747 




335 










340 










345 
















749 


TAT 


GGC 


ACC 


CAA 


GGA 


AAG 


ccc 


GCA 


ATT 


GCT 


CAT 


CGA 


GAC 


CTA 


AAG 


AGC 


1404 






7 SO 


Tyr 


Gly 


Thr 


Gin 


Gly 


Lys 


Pro 


Ala 


He 


Ala 


His 


Arg 


Asp 


Leu 


Lys 


Ser 




W- 


-> 


751 


350 










355 










360 










365 








753 


AAA 


AAC 


ATC 


CTC 


ATC 


AAG 


AAA 


AAT 


GGG 


AGT 


TGC 


TGC 


ATT 


GCT 


GAC 


CTG 


1452 






754 


Lys 


Asn 


He 


Leu 


He 


Lys 


Lys 


Asn 


Gly 


Ser 


Cys 


Cys 


He 


Ala 


Asp 


Leu 




w- 


-> 


755 










370 










375 










380 










757 


GGC 


CTT 


GCT 


GTT 


AAA 


TTC 


AAC 


AGT 


GAC 


ACA 


AAT 


GAA 


GTT 


GAT 


GTG 


CCC 


1500 






758 


Gly 


Leu 


Ala 


Val 


Lys 


Phe 


Asn 


Ser 


Asp 


Thr 


Asn 


Glu 


Val 


Asp 


Val 


Pro 




w- 


-> 


759 








385 










390 










395 












761 


TTG 


AAT 


ACC 


AGG 


GTG 


GGC 


ACC 


AAA 


CGC 


TAC 


ATG 


GCT 


CCC 


GAA 


GTG 


CTG 


1548 






762 


Leu 


Asn 


Thr 


Arg 


Val 


Gly 


Thr 


Lys 


Arg 


Tyr 


Met 


Ala 


Pro 


Glu 


Val 


Leu 




w- 


-> 


763 






400 










405 










410 














765 


GAC 


GAA 


AGC 


CTG 


AAC 


AAA 


AAC 


CAC 


TTC 


CAG 


CCC 


TAC 


ATC 


ATG 


GCT 


GAC 


1596 
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766 


Asp 


Glu 


Ser 


Leu 


Asn 


Lys 


Asn 


His 


Phe 


Gin 


Pro 


Tyr 


He 


Met 


Ala 


Asp 


w- 


-> 


767 




415 












420 










425 














769 


ATC 


TAC 


AGC 


TTC 


GGC 


CTA 


ATC 


ATT 


TGG 


GAG 


ATG 


GCT 


CGT 


CGT 


TGT 


ATC 






770 


He 


Tyr 


Ser 


Phe 


Gly 


Leu 


He 


He 


Trp 


Glu 


Met 


Ala 


Arg 


Arg 


Cys 


He 


w- 


-> 


771 


430 










435 










440 










445 






773 


ACA 


GGA 


GGG 


ATC 


GTG 


GAA 


GAA 


TAC 


CAA 


TTG 


CCA 


TAT 


TAC 


AAC 


ATG 


GTA 






774 


Thr 


Gly 


Gly 


He 


Val 


Glu 


Glu 


Tyr 


Gin 


Leu 


Pro 


Tyr 


Tyr 


Asn 


Met 


Val 


w- 


-> 


775 










450 












455 










460 








777 


CCG 


AGT 


GAT 


CCG 


TCA 


TAC 


GAA 


GAT 


ATG 


CGT 


GAG 


GTT 


GTG 


TGT 


GTC 


AAA 






778 


Pro 


Ser 


Asp 


Pro 


Ser 


Tyr 


Glu 


Asp 


Met 


Arg 


Glu 


Val 


Val 


Cys 


Val 


Lys 


w- 


-> 


779 








465 












470 










475 










781 


CGT 


TTG 


CGG 


CCA 


ATT 


GTG 


TCT 


AAT 


CGG 


TGG 


AAC 


AGT 


GAT 


GAA 


TGT 


CTA 






782 


Arg 


Leu 


Arg 


Pro 


He 


Val 


Ser 


Asn 


Arg 


Trp 


Asn 


Ser 


Asp 


Glu 


Cys 


Leu 


w- 


-> 


783 






480 












485 










490 












785 


CGA 


GCA 


GTT 


TTG 


AAG 


CTA 


ATG 


TCA 


GAA 


TGC 


TGG 


GCC 


CAC 


AAT 


CCA 


GCC 






786 


Arg 


Ala 


Val 


Leu 


Lys 


Leu 


Met 


Ser 


Glu 


Cys 


Trp 


Ala 


His 


Asn 


Pro 


Ala 


w- 


-> 


787 




495 












500 










505 














789 


TCC 


AGA 


CTC 


ACA 


GCA 


TTG 


AGA 


ATT 


AAG 


AAG 


ACG 


CTT 


GCC 


AAG 


ATG 


GTT 






790 


Ser 


Arg 


Leu 


Thr 


Ala 


Leu 


Arg 


He 


Lys 


Lys 


Thr 


Leu 


Ala 


Lys 


Met 


Val 


w- 


-> 


791 


510 










515 










520 










525 


E- 


-> 


793 


GAA 


TCC 


CAA 


GAT 


GTA 


AAA 


ATC 


TGATGGTTAA ACCATCGGAG GAGAAACTCT 


w- 


-> 


794 


1935 




































795 


Glu 


Ser 


Gin 


Asp 


Val 


Lys 


He 




















w- 


-> 


796 










530 


























E- 


-> 


798 


AGACTGCAAG 


AACTGTTTTT 


ACCCATGGCA TGGGTGGAAT 


TAGAGTGGAA 










799 


TAAGGATGTT 


1995 


























E- 


-> 


801 


AACTTGGTTC 


TCAGACTCTT 


TCTTCACTAC GTGTTCACAG 


GCTGCTAATA 










802 


TTAAACCTTT 


2055 


























E- 


-> 


804 


CAGTACTCTT 


ATTAGGATAC 


AAGCTGGGAA CTTCTAAACA 


CTTCATTCTT 










805 


TATATATGGA 


2115 


























E- 


-> 


807 


CAGCTTTATT 


TTAAATGTGG 


TTTTTGATGC CTTTTTTTAA 


GTGGGTTTTT 










808 


ATGAACTGCA 


2175 


























E- 


-> 


810 


TCAAGACTTC 


AATCCTGATT 


AGTGTCTCCA GTCAAGCTCT 


GGGTACTGAA 










811 


TTGCCTGTTC 


2235 


























E- 


-> 


813 


ATAAAACGGT 


GCTTTCTGTG 


AAAGCCTTAA GAAGATAAAT 


GAGCGCAGCA 










814 


GAGATGGAGA 


2295 


























E- 


-> 


816 


AATAGACTTT 


GCCTTTTACC 


TGAGACATTC AGTTCGTTTG 


TATTCTACCT 










817 


TTGTAAAACA 


2355 


























E- 


-> 


819 


GCCTATAGAT 


GATGATGTGT 


TTGGGATACT GCTTATTTTA 


TGATAGTTTG 










820 


TCCTGTGTCC 


2415 


























E- 


-> 


822 


TTAGTGATGT 


GTGTGTGTCT 


CCATGCACAT GCACGCCGGG 


ATTCCTCTGC 










823 


TGCCATTTGA 


2475 


























E- 


-> 


825 


ATTAGAAGAA 


AATAATTTAT 


ATGCATGCAC AGGAAGATAT 


TGGTGGCCGG 










826 


TGGTTTTGTG 


2535 


























E- 


-> 


828 


CTTTAAAAAT 


GCAATATCTG 


AC C AAG ATT C GCCAATCTCA 


TAC AAG CC AT 










829 


TTACTTTGCA 


2595 


























E- 


-> 


831 


AG TG AG AT AG 


CTTCCCCACC 


AGCTTTATTT TTTAACATGA 


AAGCTGATGC 










832 


CAAGGCCAAA 


2655 


























E- 


-> 


834 


AGAAGTTTAA 


AGCATCTGTA 


AATTTGGACT GTTTTCCTTC 


AACCACCATT 







1644 



1692 



1740 



1788 



1836 



1884 
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835 


TTTTTTGTGG 


2715 


























> 


837 


TTATTATTTT TGTCACGGAA AGCATCCTCT CCAAAGTTGG 


AGCTTCTATT 










838 


GCCATGAACC 


2775 


























> 


840 


ATGCTTACAA AGAAAGCACT TCTTATTGAA GTGAATTCCT 


GCATTTGATA 










841 


GCAATGTAAG 


2835 


























> 


843 


TGCCTATAAC CATGTTCTAT ATTCTTTATT CTCAGTAACT 


TTTAAAAGGG 










844 


AAGTTATTTA 


2895 




























846 


TATTTTGTGT ATAATGTGCT TTATTTGCAA ATCACCC 












2932 




929 


(2) INFORMATION 


FOR 


SEQ 


ID NO: 7 






















930 


(i) 


SEQUENCE CHARACTERISTICS: 




















931 




(A) LENGTH: 2333 base 


pairs 


















932 




(B) TYPE: 


nucleic 


acid 




















933 




(C) STRANDEDNESS : 


unknown 




















934 




(D) TOPOLOGY: 


linear 






















936 


(ii) 


MOLECULE TYPE: 


cDNA 






















938 


(iii) 


HYPOTHETICAL: NO 






















> 


940 


(iv) 


ANTI-SENSE; 


: NO 


























942 


( v ) 


FRAGMENT TYPE: 


internal 




















944 


(vi) 


ORIGINAL SOURCE: 
























945 




(A) ORGANISM: 


,Homo sapiens 


















947 


(ix) 


FEATURE : 




























948 




(A) NAME/KEY: 


CDS 
























949 




(B) LOCATION: 


1. .1515 






















951 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
















953 


ATG GCG 


GAG 


TCG 


GCC 


GGA 


GCC 


TCC 


TCC 


TTC 


TTC 


CCC 


CTT 


GTT 


GTC 


CTC 


48 




954 


Met Ala 


Glu 


Ser 


Ala 


Gly 


Ala 


Ser 


Ser 


Phe 


Phe 


Pro 


Leu 


Val 


Val 


Leu 






955 


1 






5 










10 










15 








957 


CTG CTC 


GCC 


GGC 


AGC 


GGC 


GGG 


TCC 


GGG 


CCC 


CGG 


GGG 


GTC 


CAG 


GCT 


CTG 


96 




958 


Leu Leu 


Ala 


Gly 


Ser 


Gly 


Gly 


Ser 


Gly 


Pro 


Arg 


Gly Val 


Gin 


Ala 


Leu 






959 






20 










25 










30 










961 


CTG TGT 


GCG 


TGC 


ACC 


AGC 


TGC 


CTC 


CAG 


GCC 


AAC 


TAC 


ACG 


TGT 


GAG 


ACA 


144 




962 


Leu Cys 


Ala 


Cys 


Thr 


Ser 


Cys 


Leu 


Gin 


Ala 


Asn 


Tyr 


Thr 


Cys 


Glu 


Thr 






963 




35 










40 










45 












965 


GAT GGG 


GCC 


TGC 


ATG 


GTT 


TCC 


TTT 


TTC 


AAT 


CTG 


GAT 


GGG 


ATG 


GAG 


CAC 


192 




966 


Asp Gly 


Ala 


Cys 


Met 


Val 


Ser 


Phe 


Phe 


Asn 


Leu 


Asp 


Gly 


Met 


Glu 


His 






967 


50 










55 










60 














969 


CAT GTG 


CGC 


ACC 


TGC 


ATC 


CCC 


AAA 


GTG 


GAG 


CTG 


GTC 


CCT 


GCC 


GGG 


AAG 


240 




970 


His Val 


Arg 


Thr 


Cys 


He 


Pro 


Lys 


Val 


Glu 


Leu 


Val 


Pro 


Ala 


Glv 


Lys 






971 


65 








70 










75 










80 






973 


CCC TTC 


TAC 


TGC 


CTG 


AGC 


TCG 


GAG 


GAC 


CTG 


CGC 


AAC 


ACC 


CAC 


TGC 


TGC 


288 




974 


Pro Phe 


Tyr 


Cys 


Leu 


Ser 


Ser 


Glu 


Asp 


Leu 


Arg 


Asn 


Thr 


His 


Cys 


Cys 






975 








85 










90 










95 








977 


TAC ACT 


GAC 


TAC 


TGC 


AAC 


AGG 


ATC 


GAC 


TTG 


AGG 


GTG 


CCC 


AGT 


GGT 


CAC 


336 




978 


Tyr Thr 


Asp 


Tyr 


Cys 


Asn 


Arg 


He 


Asp 


Leu 


Arg 


Val 


Pro 


Ser 


Gly 


His 






979 






100 










105 










110 










981 


CTC AAG 


GAG 


CCT 


GAG 


CAC 


CCG 


TCC 


ATG 


TGG 


GGC 


CCG 


GTG 


GAG 


CTG 


GTA 


384 




982 


Leu Lys 


Glu 


Pro 


Glu 


His 


Pro 


Ser 


Met 


Trp 


Gly 


Pro 


Val 


Glu 


Leu 


Val 






983 




115 










120 










125 












985 


GGC ATC 


ATC 


GCC 


GGC 


CCG 


GTG 


TTC 


CTC 


CTG 


TTC 


CTC 


ATC 


ATC 


ATC 


ATT 


432 
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986 Gly He He Ala Gly Pro Val Phe Leu Leu Phe Leu He He He He 

987 130 135 140 

989 GTT TTC CTT GTC ATT AAC TAT CAT CAG CGT GTC TAT CAC AAC CGC CAG 4 80 

990 Val Phe Leu Val He Asn Tyr His Gin Arg Val Tyr His Asn Arg Gin 

991 145 150 155 160 

993 AG A CTG GAC ATG GAA GAT CCC TCA TGT GAG ATG TGT CTC TCC AAA GAC 52 8 

994 Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu Ser Lys Asp 

995 165 170 175 

997 AAG ACG CTC CAG GAT CTT GTC TAC GAT CTC TCC ACC TCA GGG TCT GGC 576 

998 Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 

999 180 185 190 

1001 TCA GGG TTA CCC CTC TTT GTC CAG CGC ACA GTG GCC CGA ACC ATC GTT 624 

1002 Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr He Val 

1003 195 200 205 

1005 TTA CAA GAG ATT ATT GGC AAG GGT CGG TTT GGG GAA GTA TGG CGG GGC 6 72 

1006 Leu Gin Glu He He Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly 

1007 210 215 220 

1009 CGC TGG AGG GGT GGT GAT GTG GCT GTG AAA ATA TTC TCT TCT CGT GAA 72 0 

1010 Arg Trp Arg Gly Gly Asp Val Ala Val Lys He Phe Ser Ser Arg Glu 

1011 225 230 235 240 

1013 GAA CGG TCT TGG TTC AGG GAA GCA GAG ATA TAC CAG ACG GTC ATG CTG 76 8 

1014 Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr Gin Thr Val Met Leu 

1015 245 250 255 

1016 CGC CAT GAA AAC ATC CTT GGA TTT ATT GCT GCT GAC AAT AAA GAT AAT 816 

1017 Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp Asn Lys Asp Asn 

1018 260 265 v 270 

102 0 GGC ACC TGG ACA CAG CTG TGG CTT GTT TCT GAC TAT CAT GAG CAC GGG 864 

1021 Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Gly 

1022 275 280 285 

1024 TCC CTG TTT GAT TAT CTG AAC CGG TAC ACA GTG ACA ATT GAG GGG ATG 912 
102 5 Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr He Glu Gly Met 
1026 290 295 300 

1028 ATT AAG CTG GCC TTG TCT GCT GCT AGT GGG CTG GCA CAC CTG CAC ATG 96 0 

1029 He Lys Leu Ala Leu Ser Ala Ala Ser Gly Leu Ala His Leu His Met 

1030 305 310 315 320 

1032 GAG ATC GTG GGC ACC CAA GGG AAG CCT GGA ATT GCT CAT CGA GAC TTA 1008 

1033 Glu He Val Gly Thr Gin Gly Lys Pro Gly He Ala His Arg Asp Leu 

1034 325 330 335 

1036 AAG TCA AAG AAC ATT CTG GTG AAG AAA AAT GGC ATG TGT GCC ATA GCA 1056 

1037 Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Met Cys Ala He Ala 

1038 340 345 350 

104 0 GAC CTG GGC CTG GCT GTC CGT CAT GAT GCA GTC ACT GAC ACC ATT GAC 1104 

1041 Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp Thr He Asp 

1042 355 360 365 

1044 ATT GCC CCG AAT CAG AGG GTG GGG ACC AAA CGA TAC ATG GCC CCT GAA 1152 

1045 He Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 

1046 370 375 380 

104 8 GTA CTT GAT GAA ACC ATT AAT ATG AAA CAC TTT GAC TCC TTT AAA TGT 1200 
1049 Val Leu Asp Glu Thr He Asn Met Lys His Phe Asp Ser Phe Lys Cys 
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1050 385 390 395 400 

1052 GCT GAT ATT TAT GCC CTC GGG CTT GTA TAT TGG GAG ATT GCT CGA AGA 124 8 

1053 Ala Asp lie Tyr Ala Leu Gly Leu Val Tyr Trp Glu lie Ala Arg Arg 

1054 405 410 415 

1056 TGC AAT TCT GGA GGA GTC CAT GAA GAA TAT CAG CTG CCA TAT TAC GAC 1296 

1057 Cys Asn Ser Gly Gly Val His Glu Glu Tyr Gin Leu Pro Tyr Tyr Asp 

1058 420 425 430 

1060 TTA GTG CCC TCT GAC CCT TCC ATT GAG GAA ATG CGA AAG GTT GTA TGT 134 4 

1061 Leu Val Pro Ser Asp Pro Ser lie Glu Glu Met Arg Lys Val Val Cys 

1062 435 440 445 

1064 GAT CAG AAG CTG CGT CCC AAC ATC CCC AAC TGG TGG CAG AGT TAT GAG 139 2 

1065 Asp Gin Lys Leu Arg Pro Asn lie Pro Asn Trp Trp Gin Ser Tyr Glu 

1066 450 455 460 

106 8 GCA CTG CGG GTG ATG GGG AAG ATG ATG CGA GAG TGT TGG TAT GCC AAC 144 0 

1069 Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp Tyr Ala . Asn 

1070 465 470 475 480 

1072 GGC GCA GCC CGC CTG ACG GCC CTG CGC ATC AAG AAG ACC CTC TCC CAG 1488 

1073 Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Ser Gin 

1074 485 490 495 

1076 CTC AGC GTG CAG GAA GAC GTG AAG ATC TAACTGCTCC CTCTCTCCAC 153 5 

1077 Leu Ser Val Gin Glu Asp Val Lys He 

1078 500 505 

E--> 1080 ACGGAGCTCC TGGCAGCGAG AACTACGCAC AGCTGCCGCG TTGAGCGTAC 

1081 GATGGAGGCC 1595 
E--> 1083 TACCTCTCGT TTCTGCCCAG CCCTCTGTGG CCAGGAGCCC TGGCCCGCAA 

1084 GAGGGACAGA 16 55 
E--> 1086 GCCCGGGAGA GACTCGCTCA CTCCCATGTT GGGTTTGAGA CAGACACCTT 

1087 TTCTATTTAC 1715 
E--> 1089 CTCCTAATGG CATGGAGACT CTGAGAGCGA ATTGTGTGGA GAACTCAGTG 

1090 CCACACCTCG 1775 
E--> 1092 AACTGGTTGT AGTGGGAAGT CCCGCGAAAC CCGGTGCATC TGGCACGTGG 

1093 CCAGG AG CCA 1835 
E--> 1095 TGACAGGGGC GCTTGGGAGG GGCCGGAGGA ACCGAGGTGT TGCCAGTGCT 

1096 AAGCTGCCCT 1895 
E--> 1098 GAGGGTTTCC TTCGGGGACC AGCCCACAGC ACACCAAGGT GGCCCGGAAG 

1099 AACCAGAAGT 1955 
E--> 1101 GCAGCCCCTC TCACAGGCAG CTCTGAGCCG CGCTTTCCCC TCCTCCCTGG 

1102 GATGGACGCT 2 015 

E--> 1104 GCCGGGAGAC TGCCAGTGGA GACGGAATCT GCCGCTTTGT CTGTCCAGCC 

1105 GTGTGTGCAT 2075 
E--> 1107 GTGCCGAGGT GCGTCCCCCG TTGTGCCTGG TTCGTGCCAT GCCCTTACAC 

1108 GTGCGTGTGA 2135 
E--> 1110 GTGTGTGTGT GTGTCTGTAG GTGCGCACTT ACCTGCTTGA GCTTTCTGTG 

1111 CATGTGCAGG 2195 
E--> 1113 TCGGGGGTGT GGTCGTCATG CTGTCCGTGC TTGCTGGTGC CTCTTTTCAG 

1114 TAG TG AG CAG 22 55 
E--> 1116 CATCTAGTTT CCCTGGTGCC CTTCCCTGGA GGTCTCTCCC TCCCCCAGAG 

1117 CCCCTCATGC 2 315 

1119 CACAGTGGTA CTCTGTGT 2 33 3 
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1199 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 9 


1 ; 


















1200 




(i) 


SEQUENCE CHARACTERISTICS: 


















1201 






(A) LENGTH: 2308 base 


pairs 
















1202 






(B) TYPE: 


nucleic 


acid 


















1203 






(C) STRANDEDNESS: 


unknown 


















1204 






(D) TOPOLOGY: 


linear 




















1206 




(ii) 


MOLECULE TYPE: 


cDNA 




















1208 


(iii) 


HYPOTHETICAL: NO 




















c--> 


1210 




(iv) 


ANTI-SENSE: 


NO 
























1212 




(v) 


FRAGMENT TYPE: 


internal 


















1214 




(vi) 


ORIGINAL SOURCE: 






















1215 






(A) ORGANISM: 


Mouse 




















1217 




(ix) 


FEATURE : 


























1218 






(A) NAME/KEY: 


CDS 






















1219 






(B) LOCATION: 


77. , 


.1585 


















1221 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 9: 












E-- > 


1223 


GGCGAGGCGA GGTTTGCTGG GGTGAGGCAG 


1 CGGCGCGGCC 


GGGCCGGGCC 








1224 


GGGCCACAGG 




60 
























E- - > 


1226 


CGGTGGCGGC GGGACC ATG GAG GCG GCG GTC GCT GCT CCG CGT CCC CGG 


W- - > 


1227 


109 


































1228 










Met Glu Ala Ala Val Ala Ala Pro Arg Pro Arg 




1229 












1 








5 








10 






1231 


CTG 


CTC 


CTC 


CTC 


GTG 


CTG 


GCG 


GCG 


GCG 


GCG 


GCG 


GCG 


GCG 


GCG 


GCG 


CTG 




1232 


Leu 


Leu 


Leu 


Leu 


Val 


Leu 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Leu 




1233 








15 










20 










25 








1235 


CTC 


CCG 


GGG 


GCG 


ACG 


GCG 


TTA 


CAG 


TGT 


TTC 


TGC 


CAC 


CTC 


TGT 


ACA 


AAA 




1236 


Leu 


Pro 


Gly 


Ala 


Thr 


Ala 


Leu 


Gin 


Cys 


Phe 


Cys 


His 


Leu 


Cys 


Thr 


Lys 




1237 






30 










35 










40 










1239 


GAC 


AAT 


TTT 


ACT 


TGT 


GTG 


ACA 


GAT 


GGG 


CTC 


TGC 


TTT 


GTC 


TCT 


GTC 


ACA 




1240 


Asp 


Asn 


Phe 


Thr 


Cys 


Val 


Thr 


Asp 


Gly 


Leu 


Cys 


Phe 


Val 


Ser 


Val 


Thr 




1241 




45 










50 










55 












1243 


GAG 


ACC 


ACA 


GAC 


AAA 


GTT 


ATA 


CAC 


AAC 


AGC 


ATG 


TGT 


ATA 


GCT 


GAA 


ATT 




1244 


Glu 


Thr 


Thr 


Asp 


Lys 


Val 


He 


His 


Asn 


Ser 


Met 


Cys 


He 


Ala 


Glu 


He 




1245 


60 










65 










70 










75 




1247 


GAC 


TTA 


ATT 


CCT 


CGA 


GAT 


AGG 


CCG 


TTT 


GTA 


TGT 


GCA 


CCC 


TCT 


TCA 


AAA 




1248 


Asp 


Leu 


He 


Pro 


Arg 


Asp 


Arg 


Pro 


Phe 


Val 


Cys 


Ala 


Pro 


Ser 


Ser 


Lys 




1249 










80 










85 










90 






12 51 


ACT 


GGG 


TCT 


GTG 


ACT 


ACA 


ACA 


TAT 


TGC 


TGC 


AAT 


CAG 


GAC 


CAT 


TGC 


AAT 




1252 


Thr 


Gly 


Ser 


Val 


Thr 


Thr 


Thr 


Tyr 


Cys 


Cys 


Asn 


Gin 


Asp 


His 


Cys 


Asn 




1253 








95 










100 










105 








1255 


AAA 


ATA 


GAA 


CTT 


CCA 


ACT 


ACT 


GTA 


AAG 


TCA 


TCA 


CCT 


GGC 


CTT 


GGT 


CCT 




1256 


Lys 


He 


Glu 


Leu 


Pro 


Thr 


Thr 


Val 


Lys 


Ser 


Ser 


Pro 


Gly 


Leu 


Gly 


Pro 




1257 






110 










115 










120 










1259 


GTG 


GAA 


CTG 


GCA 


GCT 


GTC 


ATT 


GCT 


GGA 


CCA 


GTG 


TGC 


TTC 


GTC 


TGC 


ATC 




1260 


Val 


Glu 


Leu 


Ala 


Ala 


Val 


He 


Ala 


Gly 


Pro 


Val 


Cys 


Phe 


Val 


Cys 


He 




1261 




125 










130 










135 












1263 


TCA 


CTC 


ATG 


TTG 


ATG 


GTC 


TAT 


ATC 


TGC 


CAC 


AAC 


CGC 


ACT 


GTC 


ATT 


CAC 




1264 


Ser 


Leu 


Met 


Leu 


Met 


Val 


Tyr 


He 


Cys 


His 


Asn 


Arg 


Thr 


Val 


He 


His 




1265 


140 










145 










150 










155 



157 



205 



253 



301 



349 



397 



445 



493 



541 
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1267 


CAT 


CGA 


GTG 


CCA 


AAT 


GAA 


GAG 


GAC 


CCT 


TCA 


TTA 


GAT 


CGC 


CCT 


TTT 


ATT 


589 


1268 


His 


Arcr 


Val 


Pro 


Asn 


Glu 


Glu 


ASP 


Pro 


Ser 


Leu 


Asp 


Arcr 


Pro 


Phe 


He 




1269 










160 










165 










170 






1271 


TCA 


GAG 


GGT 


ACT 


ACG 


TTG 


AAA 


GAC 


TTA 


ATT 


TAT 


GAT 


ATG 


ACA 


ACG 


TCA 


637 


1272 


Ser 


Glu 


Gly 


Thr 


Thr 


Leu 


Lys 

2 


Asp 


Leu 


He 


Tyr 


Asp 


Met 


Thr 


Thr 


Ser 




1273 








175 










180 










185 








1275 


GGT 


TCT 


GGC 


TCA 


GGT 


TTA 


CCA 


TTG 


CTT 


GTT 


CAG 


AGA 


ACA 


ATT 


GCG 


AGA 


685 


1276 


Gly 


Ser 


Gly 


Ser 


Gly 


Leu 


Pro 


Leu 


Leu 


Val 


Gin 


Arg 


Thr 


He 


Ala 


Arg 




1277 






190 










195 










200 










1279 


ACT 


ATT 


GTG 


TTA 


CAA 


GAA 


AGC 


ATT 


GGC 


AAA 


GGT 


CGA 


TTT 


GGA 


GAA 


GTT 


733 


1280 


Thr 


He 


Val 


Leu 


Gin 


Glu 


Ser 


He 


Gly 


Lys 


Gly 


Arg 


Phe 


Gly 


Glu 


Val 




1281 




205 










210 










215 












1283 


TGG 


AGA 


GGA 


AAG 


TGG 


CGG 


GGA 


GAA 


GAA 


GTT 


GCT 


GTT 


AAG 


ATA 


TTC 


TCC 


781 


1284 


Trp 


Arcr 


Gly 


Lys 


Trp 


Arcr 


Gly 

v j_ jr 


Glu 


Glu 


Val 


Ala 


Val 


Lys 


He 


Phe 


Ser 




1285 


220 










225 










230 










235 




1287 


TCT 


AGA 


GAA 


GAA 


CGT 


TCG 


TGG 


TTC 


CGT 


GAG 


GCA 


GAG 


ATT 


TAT 


CAA 


ACT 


829 


1288 


Ser 


Arcr 


Glu 


Glu 


Arg 


Ser 


Trp 


Phe 


Arcr 


Glu 


Ala 


Glu 


He 


Tyr 


Gin 


Thr 




1289 










240 










245 










250 






1291 


GTA 


ATG 


TTA 


CGT 


CAT 


GAA 


AAC 


ATC 


CTG 


GGA 


TTT 


ATA 


GCA 


GCA 


GAC 


AAT 


877 


1292 


Val 


Met 


Leu 


Arcr 


His 


Glu 


Asn 


He 


Leu 


Gly 

2 


Phe 


He 


Ala 


Ala 


Asp 


Asn 




1293 








255 










260 










265 








1295 


AAA 


GAC 


AAT 


GGT 


ACT 


TGG 


ACT 


CAG 


CTC 


TGG 


TTG 


GTG 


TCA 


GAT 


TAT 


CAT 


925 


1296 


Lys 


Asp 


Asn 


Gly 


Thr 


Trp 

IT 


Thr 


Gin 


Leu 


Trp 


Leu 


Val 


Ser 


Asp 


Tyr 


His 




1297 






270 










275 










280 










1299 


GAG 


CAT 


GGA 


TCC 


CTT 


TTT 


GAT 


TAC 


TTA 


AAC 


AGA 


TAC 


ACA 


GTT 


ACT 


GTG 


973 


1300 


Glu 


His 


Gly 


Ser 


Leu 


Phe 


Asp 

IT 


Tyr 

2 


Leu 


Asn 


Arg 


Tyr 


Thr 


Val 


Thr 


Val 




1301 




285 










290 










295 












1303 


GAA 


GGA 


ATG 


ATA 


AAA 


CTT 


GCT 


CTG 


TCC 


ACG 


GCG 


AGC 


GGT 


CTT 


GCC 


CAT 


1021 


1304 


Glu 


Gly 


Met 


He 


Lys 


Leu 


Ala 


Leu 


Ser 


Thr 


Ala 


Ser 


Gly 


Leu 


Ala 


His 




1305 


300 










305 










310 










315 




1307 


CTT 


CAC 


ATG 


GAG 


ATT 


GTT 


GGT 


ACC 


CAA 


GGA 


AAG 


CCA 


GCC 


ATT 


GCT 


CAT 


1069 


1308 


Leu 


His 


Met 


Glu 


He 


Val 


Gly 


Thr 


Gin 


Gly 


Lys 


Pro 


Ala 


He 


Ala 


His 




1309 










320 










325 










330 






1311 


AGA 


GAT 


TTG 


AAA 


TCA 


AAG 


AAT 


ATC 


TTG 


GTA 


AAG 


AAG 


AAT 


GGA 


ACT 


TGC 


1117 


1312 


Arcr 


Asp 

It 


Leu 


Lys 


Ser 


Lys 

2 


Asn 


He 


Leu 


Val 


Lys 


Lys 


Asn 


Gly 


Thr 


Cys 




1313 








335 










340 










345 








1315 


TGT 


ATT 


GCA 


GAC 


TTA 


GGA 


CTG 


GCA 


GTA 


AGA 


CAT 


GAT 


TCA 


GCC 


ACA 


GAT 


1165 


1316 


Cys 


He 


Ala 


Asp 


Leu 


Glv 


Leu 


Ala 


Val 


Arcr 


His 


Asp 


Ser 


Ala 


Thr 


Asp 




1317 






350 










355 










360 










1319 


ACC 


ATT 


GAT 


ATT 


GCT 


CCA 


AAC 


CAC 


AGA 


GTG 


GGA 


ACA 


AAA 


AGG 


TAC 


ATG 


1213 


1320 


Thr 


He 


Asp 


He 


Ala 


Pro 


Asn 


His 


Arg 


Val 


Gly 


Thr 


Lys 


Arg 


Tyr 


Met 




1321 




365 










370 










375 












1323 


GCC 


CCT 


GAA 


GTT. 


CTC 


GAT 


GAT 


TCC 


ATA 


AAT 


ATG 


AAA 


CAT 


TTT 


GAA 


TCC 


1261 


1324 


Ala 


Pro 


Glu 


Val 


Leu 


Asp 


Asp 


Ser 


He 


Asn 


Met 


Lys 


His 


Phe 


Glu 


Ser 




1325 


380 










385 










390 










395 




1327 


TTC 


AAA 


CGT 


GCT 


GAC 


ATC 


TAT 


GCA 


ATG 


GGC 


TTA 


GTA 


TTC 


TGG 


GAA 


ATT 


1309 


1328 


Phe 


Lys 


Arg 


Ala 


Asp 


He 


Tyr 


Ala 


Met 


Gly 


Leu 


Val 


Phe 


Trp 


Glu 


He 




1329 










400 










405 










410 






1331 


GCT 


CGA 


CGA 


TGT 


TCC 


ATT 


GGT 


GGA 


ATT 


CAT 


GAA 


GAT 


TAC 


CAA 


CTG 


CCT 


1357 
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1332 Ala Arg Arg Cys Ser lie Gly Gly lie His Glu Asp Tyr Gin Leu Pro 

1333 415 420 425 

13 3 5 TAT TAT GAT CTT GTA CCT TCT GAC CCA TCA GTT GAA GAA ATG AGA AAA 14 05 

1336 Tyr Tyr Asp Leu Val Pro Ser Asp Pro Ser Val Glu Glu Met Arg Lys 

1337 430 435 440 

1339 GTT GTT TGT GAA CAG AAG TTA AGG CCA AAT ATC CCA AAC AGA TGG CAG 14 53 

1340 Val Val Cys Glu Gin Lys Leu Arg Pro Asn lie Pro Asn Arg Trp Gin 

1341 445 450 455 

134 3 AGC TGT GAA GCC TTG AGA GTA ATG GCT AAA ATT ATG AGA GAA TGT TGG 1501 

1344 Ser Cys Glu Ala Leu Arg Val Met Ala Lys lie Met Arg Glu Cys Trp' 

1345 460 465 470 475 

134 7 TAT GCC AAT GGA GCA GCT AGG CTT ACA GCA TTG CGG ATT AAG AAA ACA 154 9 

1348 Tyr Ala Asn Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr 

1349 480 485 490 

1351 TTA TCG CAA CTC AGT CAA CAG GAA GGC ATC AAA ATG TAATTCTACA 1595 

1352 Leu Ser Gin Leu Ser Gin Gin Glu Gly lie Lys Met 

1353 495 500 

E--> 1355 GCTTTGCCTG AACTCTCCTT TTTTCTTCAG ATCTGCTCCT GGGTTTTAAT 

1356 TTGGGAGGTC 1655 

E--> 1358 AGTTGTTCTA CCTCACTGAG AGGGAACAGA AGGATATTGC TTCCTTTTGC 

1359 AGCAGTGTAA 1715 

E--> 1361 TAAAGTCAAT TAAAAACTTC CCAGGATTTC TTTGGACCCA GGAAACAGCC 

1362 ATGTGGGTCC 1775 

E--> 1364 TTTCTGTGCA CTATGAACGC TTCTTTCCCA GGACAGAAAA TGTGTAGTCT \ 

136 5 ACCTTTATTT 18 35 

E--> 1367 TTTATTAACA AAACTTGTTT TTTAAAAAGA TGATTGCTGG TCTTAACTTT 

1368 AGGTAACTCT 1895 

E--> 1370 GCTGTGCTGG AGATCATCTT TAAGGGCAAA GGAGTTGGAT TGCTGAATTA 

1371 CAATGAAACA 1955 

E--> 1373 TGTCTTATTA CTAAAGAAAG TGATTTACTC CTGGTTAGTA CAT TCT CAG A 

1374 GGATTCTGAA 2 015 

E--> 1376 CC AC TAG AGT TTCCTTGATT CAGACTTTGA ATGTACTGTT CTATAGTTTT 

1377 TCAGGATCTT 2 075 

E--> 1379 AAAACTAACA CTTATAAAAC TCTTATCTTG AGTCTAAAAA TGACCTCATA 

1380 TAGTAGTGAG 2135 

E--> 1382 GAACATAATT CATGCAATTG TATTTTGTAT ACTATTATTG TTCTTTCACT 

1383 TATTCAGAAC 2195 

E--> 1385 ATTACATGCC TTCAAAATGG GATTGTACTA TACCAGTAAG TGCCACTTCT 

1386 GTGTCTTTCT 2 2 55 

E--> 1388 AATGGAAATG AGT AG AAT TG CTGAAAGTCT CTATGTTAAA ACCTATAGTG TTT 

W--> 1389 2308 

1468 (2) INFORMATION FOR SEQ ID NO: 11: 

1469 (i) SEQUENCE CHARACTERISTICS: 

1470 (A) LENGTH: 1922 base pairs 

1471 (B) TYPE: nucleic acid 

14 72 (C) STRANDEDNESS : unknown 
1473 (D) TOPOLOGY: linear 

14 7 5 (ii) MOLECULE TYPE: cDNA 

14 77 (iii) HYPOTHETICAL: NO 
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c--> 


1479 




(iv) 


ANTI-SENSE 


: NO 
























1481 




(v) 


FRAGMENT TYPE: 


internal 


















1483 




(vi) 


ORIGINAL SOURCE: 






















1484 






(A) ORGANISM: 


Mouse 




















1486 




(ix) 


FEATURE : 


























1487 






(A) NAME/KEY: 


CDS 






















1488 






(B) LOCATION: 


241 . 


. .1746 


















1490 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 11: 










E- - > 


1492 


GAGAGCACAG CCCTTCCCAG TCCCCGGAGC CGCCGCGCCA 


CGCGCGCATG 








1493 


ATCAAGACCT 


60 
























E--> 


1495 


TTTCCCCGGC CCCACAGGGC CTCTGGACGT GAGACCCCGG 


CCGCCTCCGC 








1496 


AAGGAGAGGC 


120 
























E--> 


1498 


GGGGGTCGAG TCGCCCTGTC CAAAGGCCTC AATCTAAACA 


ATCTTGATTC 








1499 


CTGTTGCCGG 


180 
























E--> 


1501 


CTGGCGGGAC CCTGAATGGC AGGAAATCTC ACCACATCTC 


TTCTCCTATC 








1502 


TCCAAGGACC 


240 


























1504 


ATG 


ACC 


TTG 


GGG AGC 


TTC 


AGA 


AGG 


GGC 


CTT 


TTG 


ATG 


CTG 


TCG 


GTG 


GCC 




1505 


Met 


Thr 


Leu 


Gly Ser 


Phe 


Arg 


Arg 


Gly 


Leu 


Leu 


Met 


Leu 


Ser 


Val 


Ala 




1506 


1 






5 










10 










15 






1508 


TTG 


GGC 


CTA 


ACC CAG 


GGG 


AGA 


CTT 


GCG 


AAG 


CCT 


TCC 


AAG 


CTG 


GTG 


AAC 




1509 


Leu 


Gly 


Leu 


Thr Gin 


Gly 


Arg 


Leu 


Ala 


Lys 


Pro 


Ser 


Lys 


Leu 


Val 


Asn 




1510 








20 








25 










30 








1512 


TGC 


ACT 


TGT 


GAG AGC 


CCA 


CAC 


TGC 


AAG 


AGA 


CCA 


TTC 


TGC 


CAG 


GGG 


TCA 




1513 


Cys 


Thr 


Cys 


Glu Ser 


Pro 


His 


Cys 


Lys 


Arg 


Pro 


Phe 


Cys 


Gin 


Gly 


Ser 




1514 






35 








40 










45 










1516 


TGG 


TGC 


ACA 


GTG GTG 


CTG 


GTT 


CGA 


GAG 


CAG 


GGC 


AGG 


CAC 


CCC 


CAG 


GTC 




1517 


Trp 


Cys 


Thr 


Val Val 


Leu 


Val 


Arg 


Glu 


Gin 


Gly 


Arg 


His 


Pro 


Gin 


Val 




1518 




50 








55 










60 












1520 


TAT 


CGG 


GGC 


TGT GGG 


AGC 


CTG 


AAC 


CAG 


GAG 


CTC 


TGC 


TTG 


GGA 


CGT 


CCC 




1521 


Tyr 


Arg 


Gly 


Cys Gly 


Ser 


Leu 


Asn 


Gin 


Glu 


Leu 


Cys 


Leu 


Gly 


Arcr 


Pro 




1522 


65 








70 










75 










80 




1524 


ACG 


GAG 


TTT 


CTG AAC 


CAT 


CAC 


TGC 


TGC 


TAT 


AGA 


TCC 


TTC 


TGC 


AAC 


CAC 




1525 


Thr 


Glu 


Phe 


Leu Asn 


His 


His 


Cys 


Cys 


Tyr 


Arg 


Ser 


Phe 


Cys 


Asn 


His 




1526 








85 










90 










95 






1528 


AAC 


GTG 


TCT 


CTG ATG 


CTG 


GAG 


GCC 


ACC 


CAA 


ACT 


CCT 


TCG 


GAG 


GAG 


CCA 




1529 


Asn 


val 


Ser 


Leu Met 


Leu 


Glu 


Ala 


Thr 


Gin 


Thr 


Pro 


Ser 


Glu 


Glu 


Pro 




1530 








100 








105 










110 








1532 


GAA 


GTT 


GAT 


GCC CAT 


CTG 


CCT 


CTG 


ATC 


CTG 


GGT 


CCT 


GTG 


CTG 


GCC 


TTG 




1533 


GlU 


Val 


Asp 


Ala His 


Leu 


Pro 


Leu 


He 


Leu 


Gly 


Pro 


Val 


Leu 


Ala 


Leu 




1534 






115 








120 










125 










1536 


CCG 


GTC 


CTG 


GTG GCC 


CTG 


GGT 


GCT 


CTG 


GGC 


TTG 


TGG 


CGT 


GTC 


CGG 


CGG 




1537 


Pro 


Val 


Leu 


Val Ala 


Leu 


Gly 


Ala 


Leu 


Gly 


Leu 


Trp 


Arg 


Val 


Arg 


Arg 




1538 




130 








135 










140 












1540 


AGG 


CAG 


GAG 


AAG CAG 


CGG 


GAT 


TTG 


CAC 


AGT 


GAC 


CTG 


GGC 


GAG 


TCC 


AGT 




1541 


Arg 


Gin 


Glu 


Lys Gin 


Arg 


Asp 


Leu 


His 


Ser 


Asp 


Leu 


Gly 


Giu 


Ser 


Ser 




1542 


145 








150 










155 










160 




1544 


CTC 


ATC 


CTG 


AAG GCA 


TCT 


GAA 


CAG 


GCA 


GAC 


AGC 


ATG 


TTG 


GGG 


GAC 


TTC 




1545 


Leu 


He 


Leu 


Lys Ala 


Ser 


Glu 


Gin 


Ala 


Asp 


Ser 


Met 


Leu 


Gly 


Asp 


Phe 




1546 








165 










170 










175 





288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



file://C:\Crf3\Outhold\VsrI039177D.htm 



10/15/01 



• 



Page 18 of 35 



RAW SEQUENCE LISTING DATE: 10/15/2001 

PATENT APPLICATION: US/09/039 , 177D TIME: 13:47:04 



Input Set : A:\Lud-5539 sequence listing.txt 
Output Set: N:\CRF3\10152001\I039177D.raw 



1548 


CTG 


GAC 


AGC 


GAC 


TGT 


ACC 


ACG 


GGC 


AGC 


GGC 


TCG 


GGG 


CTC 


CCC 


TTC 


TTG 


816 


1549 


Leu 


Asp 


Ser 


Asp 


Cys 


Thr 


Thr 


Gly 


Ser 


Gly 


Ser 


Gly 


Leu 


Pro 


Phe 


Leu 




1550 








180 










185 










190 








1552 


GTG 


CAG 


AGG 


ACG 


GTA 


GCT 


CGG 


CAG 


GTT 


GCG 


CTG 


GTA 


GAG 


TGT 


GTG 


GGA 


864 


1553 


Val 


Gin 


Arg 


Thr 


Val 


Ala 


Arg 


Gin 


Val 


Ala 


Leu 


Val 


Glu 


Cys 


Val 


Gly 




1554 






195 










200 










205 










1556 


AAG 


GGC 


CGA 


TAT 


GGC 


GAG 


GTG 


TGG 


CGC 


GGT 


TCG 


TGG 


CAT 


GGC 


GAA 


AGC 


912 


1557 


Lys 


Gly 


Arg 


Tyr 


Gly 


Glu 


Val 


Trp 


Arg 


Gly 


Ser 


Trp 


His 


Gly 


Glu 


Ser 




1558 




210 










215 










220 












1560 


GTG 


GCG 


GTC 


AAG 


ATT 


TTC 


TCC 


TCA 


CGA 


GAT 


GAG 


CAG 


TCC 


TGG 


TTC 


CGG 


960 


1561 


Val 


Ala 


Val 


Lys 


He 


Phe 


Ser 


Ser 


Arg 


Asp 


Glu 


Gin 


Ser 


Trp 


Phe 


Arg 




1562 


225 










230 










235 










240 




1564 


GAG 


ACG 


GAG 


ATC 


TAC 


AAC 


ACA 


GTT 


CTG 


CTT 


AGA 


CAC 


GAC 


AAC 


ATC 


CTA 


1008 


1565 


Glu 


Thr 


Glu 


He 


Tyr 


Asn 


Thr 


Val 


Leu 


Leu 


Arg 


His 


Asp 


Asn 


He 


Leu 




1566 










245 










250 










255 






1568 


GGC 


TTC 


ATC 


GCC 


TCC 


GAC 


ATG 


ACT 


TCG 


CGG 


AAC 


TCG 


AGC 


ACG 


CAG 


CTG 


105,6 


1569 


Gly 


Phe 


lie 


Ala 


Ser 


Asp 


Met 


Thr 


Ser 


Arg 


Asn 


Ser 


Ser 


Thr 


Gin 


Leu 




1570 








260 










265 










270 








1572 


TGG 


CTC 


ATC 


ACC 


CAC 


TAC 


CAT 


GAA 


CAC 


GGC 


TCC 


CTC 


TAT 


GAC 


TTT 


CTG 


1104 


1573 


Trp 


Leu 


He 


Thr 


His 


Tyr 


His 


Glu 


His 


Gly 


Ser 


Leu 


Tyr 


Asp 


Phe 


Leu 




1574 






275 










280 










285 










1576 


CAG 


AGG 


CAG 


ACG 


CTG 


GAG 


CCC 


CAG 


TTG 


GCC 


CTG 


AGG 


CTA 


GCT 


GTG 


TCC 


1152 


1577 


Gin 


Arg 


Gin 


Thr 


Leu 


Glu 


Pro 


Gin 


Leu 


Ala 


Leu 


Arg 


Leu 


Ala 


Val 


Ser 




1578 




290 










295 










300 












1580 


CCG 


GCC 


TGC 


GGC 


CTG 


GCG 


CAC 


CTA 


CAT 


GTG 


GAG 


ATC 


TTT 


GGC 


ACT 


CAA 


1200 


1581 


Pro 


Ala 


Cys 


Gly 


Leu 


Ala 


His 


Leu 


His 


Val 


Glu 


He 


Phe 


Gly 


Thr 


Gin 




1582 


305 










310 










315 










320 




1584 


GGC 


AAA 


CCA 


GCC 


ATT 


GCC 


CAT 


CGT 


GAC 


CTC 


AAG 


AGT 


CGC 


AAT 


GTG 


CTG 


1248 


1585 


Gly 


Lys 


Pro 


Ala 


He 


Ala 


His 


Arg 


Asp 


Leu 


Lys 


Ser 


Arg 


Asn 


Val 


Leu 




1586 










325 










330 










335 






1588 


GTC 


AAG 


AGT 


AAC 


TTG 


CAG 


TGT 


TGC 


ATT 


GCA 


GAC 


CTG 


GGA 


CTG 


GCT 


GTG 


1296 


1589 


Val 


Lys 


Ser 


Asn 


Leu 


Gin 


Cys 


Cys 


He 


Ala 


Asp 


Leu 


Gly 


Leu 


Ala 


Val 




1590 








340 










345 










350 








1592 


ATG 


CAC 


TCA 


CAA 


AGC 


AAC 


GAG 


TAC 


CTG 


GAT 


ATC 


GGC 


AAC 


ACA 


CCC 


CGA 


1344 


1593 


Met 


His 


Ser 


Gin 


Ser 


Asn 


Glu 


Tyr 


Leu 


Asp 


He 


Gly 


Asn 


Thr 


Pro 


Arg 




1594 






355 










360 










365 










1596 


GTG 


GGT 


ACC 


AAA 


AGA 


TAC 


ATG 


GCA 


CCC 


GAG 


GTG 


CTG 


GAT 


GAG 


CAC 


ATC 


1392 


1597 


Val 


Gly 


Thr 


Lys 


Arq 


Tyr 


Met 


Ala 


Pro 


Glu 


Val 


Leu 


Asp 


Glu 


His 


He 




1598 




370 










375 










380 












1600 


CGC 


ACA 


GAC 


TGC 


TTT 


GAG 


TCG 


TAC 


AAG 


TGG 


ACA 


GAC 


ATC 


TGG 


GCC 


TTT 


1440 


1601 


Arg 


Thr 


Asp 


Cys 


*Phe 


Glu 


Ser 


Tyr 


Lys 


Trp 


Thr 


Asp 


He 


Trp 


Ala 


Phe 




1602 


385 










390 










395 










400 




1604 


GGC 


CTA 


GTG 


CTA 


TGG 


GAG 


ATC 


GCC 


CGG 


CGG 


ACC 


ATC 


ATC 


AAT 


GGC 


ATT 


1488 


1605 


Gly 


Leu 


Val 


Leu 


Trp 


Glu 


He 


Ala 


Arg 


Arg 


Thr 


He 


He 


Asn 


Gly 


He 




1606 










405 










410 










415 






1608 


GTG 


GAG 


GAT 


TAC 


AGG 


CCA 


CCT 


TTC 


TAT 


GAC 


ATG 


GTA 


CCC 


AAT 


GAC 


CCC 


1536 


1609 


Val 


Glu 


Asp 


Tyr 


Arg 


Pro 


Pro 


Phe 


Tyr 


Asp 


Met 


Val 


Pro 


Asn 


Asp 


Pro 




1610 








420 










425 










430 








1612 


AGT 


TTT 


GAG 


GAC 


ATG 


AAA 


AAG 


GTG 


GTG 


TGC 


GTT 


GAC 


CAG 


CAG 


ACA 


CCC 


1584 
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1613 Ser Phe Glu Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro 

1614 435 440 445 

1616 ACC ATC CCT AAC CGG CTG GCT GCA GAT CCG GTC CTC TCC GGG CTG GCC 1632 
1617. Thr lie Pro Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala 
1618 450 455 460 

162 0 CAG ATG ATG AGA GAG TGC TGG TAC CCC AAC CCC TCT GCT CGC CTC ACC 168 0 

1621 Gin Met Met Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu Thr 

1622 465 470 475 480 

1624 GCA CTG CGC ATA AAG AAG ACA TTG CAG AAG CTC AGT CAC AAT CCA GAG 172 8 

1625 Ala Leu Arg lie Lys Lys Thr Leu Gin Lys Leu Ser His Asn Pro Glu 

1626 485 490 495 

1628 AAG CCC AAA GTG ATT CAC TAGCCCAGGG CCACCAGGCT TCCTCTGCCT 177 6 

1629 Lys Pro Lys Val lie His 

1630 500 

E--> 1632 AAAGTGTGTG CTGGGGAAGA AGACATAGCC TGTCTGGGTA GAGGGAGTGA 

16 3 3 AGAGAGTGTG 1836 
E--> 163 5 CACGCTGCCC TGTGTGTGCC TGCTCAGCTT GCTCCCAGCC CATCCAGCCA 

1636 AAAATACAGC 1896 

16 38 TGAGCTGAAA T T C AAA AAA A AAAAAA 1922 

1716 (2) INFORMATION FOR SEQ ID NO: 13: 

1717 (i) SEQUENCE CHARACTERISTICS: 

1718 (A) LENGTH: 2070 base pairs 

1719 (B) TYPE: nucleic acid 

1720 (C) STRANDEDNESS : unknown 

1721 (D) TOPOLOGY: linear 
1723 (ii) MOLECULE TYPE: cDNA 
172 5 (iii) HYPOTHETICAL: NO 

C--> 1727 (iv) ANTI-SENSE: NO 

1729 (v) FRAGMENT TYPE: internal 

1731 (vi) ORIGINAL SOURCE: 

1732 (A) ORGANISM: Mouse 

1734 (ix) FEATURE: 

1735 (A) NAME/KEY: CDS 

1736 (B) LOCATION: 217.. 1812 

1738 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

E--> 1740 AT TC ATG AG A TGGAAGCATA GGTCAAAGCT GTTCGGAGAA ATTGGAACTA 

1741 CAGTTTTATC 6 0 

E--> 174 3 TAG C CAC ATC TCTGAGAATT CTGAAGAAAG CAGCAGGTGA AAG T CAT TGC 

174 4 CAAGTGATTT 12 0 

E--> 1746 TGTTCTGTAA GGAAGCCTCC CTCATTCACT TACACCAGTG AGACAGCAGG 

174 7 ACCAGTCATT 180 
E--> 1749 CAAAGGGCCG TGTACAGGAC GCGTGGCAAT CAG ACA ATG ACT CAG CTA TAC ACT 
W--> 1750 234 

1751 Met Thr Gin Leu Tyr Thr 

1752 1 5 

1754 TAC ATC AGA TTA CTG GGA GCC TGT CTG TTC ATC ATT TCT CAT GTT CAA 2 82 

1755 Tyr lie Arg Leu Leu Gly Ala Cys Leu Phe lie lie Ser His Val Gin 

1756 10 15 20 

17 58 GGG CAG AAT CTA GAT AGT ATG CTC CAT GGC ACT GGT ATG AAA TCA GAC 330 
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1759 


Gly Gin 


Asn 


Leu 


Asp 


Ser 


Met 


Leu 


His 


Gly 


Thr 


Gly 


Met 


Lys 


Ser 


Asp 




1760 






25 










30 










35 










1762 


TTG 


GAC 


CAG 


AAG 


AAG 


CCA 


GAA 


AAT 


GGA 


GTG 


ACT 


TTA 


GCA 


CCA 


GAG 


GAT 


378 


1763 


Leu 


Asp 


Gin 


Lys 


Lys 


Pro 


Glu 


Asn 


Gly 


Val 


Thr 


Leu 


Ala 


Pro 


Glu 


Asp 




1764 




40 










45 










50 












1766 


ACC 


TTG 


CCT 


TTC 


TTA 


AAG 


TGC 


TAT 


TGC 


TCA 


GGA 


CAC 


TGC 


CCA 


GAT 


GAT - 


426 


1767 


Thr 


Leu 


Pro 


Phe 


Leu 


Lys 


Cys' 


Tvr 

JL 


Cys 


Ser 


Glv 


His 


Cvs 


Pro 


Asp 


Asp 

IT 




1768 


55 










60 










65 










70 




1770 


GCT 


ATT 


AAT 


AAC 


ACA 


TGC 


ATA 


ACT 


AAT 


GGC 


CAT 


TGC 


TTT 


GCC 


ATT 


ATA 


474 


1771 


Ala 


He 


Asn 


Asn 


Thr 


Cys 


He 


Thr 


Asn 


Glv 


His 


Cvs 


Phe 


Ala 


He 


He 




1772 










75 










80 










85 






1774 


GAA 


GAA 


GAT 


GAT 


CAG 


GGA 


GAA 


ACC 


ACA 


TTA 


ACT 


TCT 


GGG 


TGT 


ATG 


AAG 


522 


1775 


Glu 


Glu 


Asp 


Asp 


Gin 


Glv 


Glu 


Thr 


Thr 


Leu 


Thr 


Ser 


Glv 


Cys 


Met 


Lys 




1776 








90 










95 










100 








1778 


TAT 


GAA 


GGC 


TCT 


GAT 


TTT 


CAA 


TGC 


AAG 


GAT 


TCA 


CCG 


AAA 


GCC 


CAG 


CTA 


570 


1779 


Tvr 


Glu 


Glv 


Ser 


Asp 


Phe 


Gin 


Cys 


Lys 


Asp 


Ser 


Pro 


Lys 


Ala 


Gin 


Leu 




1780 






105 










110 










115 










1782 


CGC 


AGG 


ACA 


ATA 


GAA 


TGT 


TGT 


CGG 


ACC 


AAT 


TTG 


TGC 


AAC 


CAG 


TAT 


TTG 


618 


1783 


Arg 


Arg 


Thr 


He 


Glu 


Cvs 


Cvs 

2 


Arcr 


Thr 


Asn 


Leu 


Cys 


Asn 


Gin 


Tyr 


Leu 




1784 




120 










125 










130 












1786 


CAG 


CCT 


ACA 


CTG 


ccc 


CCT 


GTT 


GTT 


ATA 


GGT 


CCG 


TTC 


TTT 


GAT 


GGC 


AGC 


666 


1787 


Gin 


Pro 


Thr 


Leu 


Pro 


Pro 


Val 


Val 


He 


Gly 


Pro 


Phe 


Phe 


Asp 


Gly 


Ser 




1788 


135 










14 0 










145 










150 




1790 


ATC 


CGA 


TGG 


CTG 


GTT 


GTG 


CTC 


ATT 


TCC 


ATG 


GCT 


GTC 


TGT 


ATA 


GTT 


GCT 


714 


1791 


He 


Arg 


Trp 


Leu 


Val 


Val 


Leu 


He 


Ser 


Met 


Ala 


Val 


Cys 


He 


Val 


Ala 




1792 










155 










160 










165 






1794 


ATG 


ATC 


ATC 


TTC 


TCC 


AGC 


TGC 


TTT 


TGC 


TAT 


AAG 


CAT 


TAT 


TGT 


AAG 


AGT 


762 


1795 


Met 


He. 


He 


Phe 


Ser 


Ser 


Cvs 


Phe 


Cvs 


Tvr 


Lvs 


His 


Tvr 


CVS 


Lvs 


Ser 




1796 








170 










175 










180 








1798 


ATC 


TCA 


AGC 


AGG 


GGT 


CGT 


TAC 


AAC 


CGT 


GAT 


TTG 


GAA 


CAG 


GAT 


GAA 


GCA 


810 


1799 


He 


Ser 


Ser 


Arcr 


Glv 


Arg 


Tvr 


Asn 


Arcr 


Asp 


Leu 


Glu 


Gin 


Asp 


Glu 


Ala 




1800 






185 










190 










195 










1802 


TTT 


ATT 


CCA 


GTA 


GGA 


GAA 


TCA 


TTG 


AAA 


GAC 


CTG 


ATT 


GAC 


CAG 


TCC 


CAA 


858 


1803 


Phe 


He 


Pro 


Val 


Glv 


Glu 


Ser 


Leu 


Lys 


Asp 


Leu 


He 


Asp 


Gin 


Ser 


Gin 




1804 




200 










205 










210 












1806 


AGC 


TCT 


GGG 


AGT 


GGA 


TCT 


GGA 


TTG 


CCT 


TTA 


TTG 


GTT 


CAG 


CGA 


ACT 


ATT 


906 


1807 


Ser 


Ser 


Glv 


Ser 


Glv 


Ser 


Glv 


Leu 


Pro 


Leu 


Leu 


Val 


Gin 


Arcr 


Thr 


He 




1808 


215 










220 










225 










230 




1810 


GCC 


AAA 


CAG 


ATT 


CAG 


ATG 


GTT 


CGG 


CAG 


GTT 


GGT 


AAA 


GGC 


CGC 


TAT 


GGA 


954 


1811 


Ala 


Lys 


Gin 


He 


Gin 


Met 


Val 


Arg 


Gin 


Val 


Gly 


Lys 


Gly 


Arg 


Tyr 


Gly 




1812 










235 










240 










245 






1814 


GAA 


GTA 


TGG 


ATG 


GGT 


AAA 


TGG 


CGT 


GGT 


GAA 


AAA 


GTG 


GCT 


GTC 


AAA 


GTG 


1002 


1815 


Glu 


Val 


Trp 


Met 


Gly 


Lys 


Trp 


Arg 


Gly 


Glu 


Lys 


Val 


Ala 


Val 


Lys 


Val 




1816 








250 










255 










260 








1818 


TTT 


TTT 


ACC 


ACT 


GAA 


GAA 


GCT 


AGC 


TGG 


TTT 


AGA 


GAA 


ACA 


GAA 


ATC 


TAC 


1050 


1819 


Phe 


Phe 


Thr 


Thr 


Glu 


Glu 


Ala 


Ser 


Trp 


Phe 


Arg 


Glu 


Thr 


Glu 


He 


Tyr 




1820 






265 










270 










275 










1822 


CAG 


ACG 


GTG 


TTA 


ATG 


CGT 


CAT 


GAA 


AAT 


ATA 


CTT 


GGT 


TTT 


ATA 


GCT 


GCA 


1098 


1823 


Gin 


Thr 


Val 


Leu 


Met 


Arg 


His 


Glu 


Asn 


He 


Leu 


Gly 


Phe 


He 


Ala 


Ala 
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1824 




280 










285 










290 














1826 


GAC 


ATT 


AAA 


GGC 


ACT 


GGT 


TCC 


TGG 


ACT 


CAG 


CTG 


TAT 


TTG 


ATT 


ACT 


GAT 


1146 




1827 


Asp 


lie 


Lys 


Glv 


Thr 


Glv 

^ j_ j 


Ser 


TrD 


Thr 


Gin 


Leu 


Tvr 


Leu 


He 


Thr 


Asp 






1828 


295 










300 










305 










310 






1830 


TAC 


CAT 


GAA 


AAT 


GGA 


TCT 


CTC 


TAT 


GAC 


TTC 


CTG 


AAA 


TGT 


GCC 


ACA 


CTA 


1194 




1831 


Tvr 

2 


His 


Glu 


Asn 


Glv 

\j j_ jr 


Ser 


Leu 


Tvr 


Asp 


Phe 


Leu 


Lvs 


Cvs 


Ala 


Thr 


Leu 






1832 










315 










320 










325 








1834 


GAC 


ACC 


AGA 


GCC 


CTA 


CTC 


AAG 


TTA 


GCT 


TAT 


TCT 


GCT 


GCT 


TGT 


GGT 


CTG 


1242 




1835 


Asp 


Thr 


Arg 


Ala 


Leu 


Leu 


Lys 


Leu 


Ala 


Tvr 

2 


Ser 


Ala 


Ala 


Cvs 


Glv 

2 


Leu 






1836 








330 










335 










340 










1838 


TGC 


CAC 


CTC 


CAC 


ACA 


GAA 


ATT 


TAT 


GGT 


ACC 


CAA 


GGG 


AAG 


CCT 


GCA 


ATT 


1290 




1839 


Cys 


His 


Leu 


His 


Thr 


Glu 


lie 


Tvr 


Glv 


Thr 


Gin 


Gly 

2 


LVS 

2 


Pro 


Ala 


He 






1840 






345 










350 










355 












1842 


GCT 


CAT 


CGA 


GAC 


CTG 


AAG 


AGC 


AAA 


AAC 


ATC 


•CTT 


ATT 


AAG 


AAA 


AAT 


GGA 


"1338 




1843 


Ala 


His 


Arcr 


Asp 


Leu 


LVS 


Ser 


Lvs 


Asn 


He 


Leu 


He 


Lvs 


Lvs 


Asn 


Gly 

2 






1844 




360 










365 










370 














1846 


AGT 


TGC 


TGT 


ATT 


GCT 


GAC 


CTG 


GGC 


CTA 


GCT 


GTT 


AAA 


TTC 


AAC 


AGT 


GAT 


1386 




1847 


Ser 


Cys 


Cys 


lie 


Ala 


Asp 


Leu 


Glv 


Leu 


Ala 


Val 


Lvs 

2 


Phe 


Asn 


Ser 


Asp 






1848 


375 










380 










385 










390 






1850 


ACA 


AAT 


GAA 


GTT 


GAC 


ATA 


CCC 


TTG 


AAT 


ACC 


AGG 


GTG 


GGC 


ACC 


AAG 


CGG 


1434 




1851 


Thr 


Asn 


GlU 


Val 


Asp 

IT 


lie 


Pro 


Leu 


Asn 


Thr 


Arq 


Val 


Gly 


Thr 


Lys 


Arg 






1852 










395 










400 










405 








1854 


TAC 


ATG 


GCT 


CCA 


GAA 


GTG 


CTG 


GAT 


GAA 


AGC 


CTG 


AAT 


AAA 


AAC 


CAT 


TTC 


1482 




1855 


Tvr 


Met 


Ala 


Pro 


Glu 


Val 


Leu 


Asp 


Glu 


Ser 


Leu 


Asn 


LVS 

2 


Asn 


His 


Phe 






1856 








410 










415 










420 










1858 


CAG 


CCC 


TAC 


ATC 


ATG 


GCT 


GAC 


ATC 


TAT 


AGC 


TTT 


GGT 


TTG 


ATC 


ATT 


TGG 


1530 




1859 


Gin 


Pro 


Tvr 


lie 


Met 


Ala 


Asp 


He 


Tyr 


Ser 


Phe 


Glv 


Leu 


He 


He 


Trp 






1860 






425 










430 










435 












1862 


GAA 


ATG 


GCT 


CGT 


CGT 


TGT 


ATT 


ACA 


GGA 


GGA 


ATC 


GTG 


GAG 


GAA 


TAT 


CAA 


1578 




1863 


GlU 


Met 


Ala 


Arg 


Arg 


Cys 


lie 


Thr 


Gly 


Glv 


He 


Val 


Glu 


Glu 


Tvr 

2 


Gin 






1864 




440 










445 










450 














1866 


TTA 


CCA 


TAT 


TAC 


AAC 


ATG 


GTG 


CCC s AGT 


GAC 


CCA 


TCC 


TAT 


GAG 


GAC 


ATG 


1626 




1867 


Leu 


Pro 


Tvr 


Tvr 


Asn 


Met 


Val 


Pro 


Ser 


Asp 


Pro 


Ser 


Tvr 

■* 


Glu 


Asp 


Met 






1868 


455 










460 










465 










470 






1870 


CGT 


GAG 


GTT 


GTG 


TGT 


GTG 


AAA 


CGC 


TTG 


CGG 


CCA 


ATC 


GTG 


TCT 


AAC 


CGC 


1674 




1871 


Arg 


Glu 


Val 


Val 


Cys 


Val 


Lys 


Arg 


Leu 


Arg 


Pro 


He 


Val 


Ser 


Asn 


Arcr 






1872 










475 










480 










485 








1874 


TGG 


AAC 


AGC 


GAT 


GAA 


TGT 


CTT 


CGA 


GCA 


GTT 


TTG 


AAG 


CTA 


ATG 


TCA 


GAA 


1722 




1875 


Trp 


Asn 


Ser 


Asp 


Glu 


Cys 


Leu 


Arg 


Ala 


Val 


Leu 


Lys 


Leu 


Met 


Ser 


Glu 






1876 








490 










495 










500 










1878 


TGT 


TGG 


GCC 


CAT 


AAT 


CCA 


GCC 


TCC 


AGA 


CTC 


ACA 


GCT 


TTG 


AGA 


ATC 


AAG 


1770 




1879 


Cys 


Trp 


Ala 


His 


Asn 


Pro 


Ala 


Ser 


Arg 


Leu 


Thr 


Ala 


Leu 


Arg 


He 


Lys 






1880 






505 










510 










515 












1882 


AAG 


ACA 


CTT 


GCA 


AAA 


ATG 


GTT 


GAA 


TCC 


CAG 


GAT 


GTA 


AAG 


ATT 






1812 




1883 


Lys 


Thr 


Leu 


Ala 


Lys 


Met 


Val 


Glu 


Ser 


Gin 


Asp 


Val 


Lys 


He 










1884 




520 










525 










530 












E--> 


1886 


TGACAATTAA ACAATTTTGA GGGAGAATTT AGACTGCAAG 


AACTTCTTCA 










1887 


CCCAAGGAAT 


1872 


























E--> 


1889 


GGGTGGGATT AGCATGGAAT AGGATGTTGA CTTGGTTTCC 


AGACTCCTTC 
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1890 CTCTACATCT 1932 
E--> 1892 TCACAGGCTG CTAACAGTAA ACCTTACCGT ACTCTACAGA ATACAAGATT 

189 3 GGAACTTGGA 1992 
E--> 1895 ACTTCAAACA TGTCATTCTT TATATATGAC AGCTTTGTTT TAATGTGGGG 

1896 TTTTTTTGTT 2052 

1898 TGCTTTTTTT GTTTTGTT 2 070 

1981 (2) INFORMATION FOR SEQ ID NO: 15: 

1982 (i) SEQUENCE CHARACTERISTICS: 

1983 (A) LENGTH: 2160 base pairs 

1984 (B) TYPE: nucleic acid 

1985 (C) STRANDEDNESS: unknown 

1986 (D) TOPOLOGY: linear 
1988 (ii) MOLECULE TYPE: cDNA 
1990 (iii) HYPOTHETICAL: NO 

C--> 1992 (iv) ANTI-SENSE: NO 

1994 (v) FRAGMENT TYPE: internal 

1996 (vi) ORIGINAL SOURCE: 

1997 (A) ORGANISM: Mouse 

1999 (ix) FEATURE: 

2000 (A) NAME/KEY: CDS 

2001 (B) LOCATION: 10.. 1524 

2003 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

2005 CGCGGTTAC ATG GCG GAG TCG GCC GGA GCC TCC TCC TTC TTC CCC CTT 4 8 

2006 Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu 

2007 1 5 10 

2 009 GTT GTC CTC CTG CTC GCC GGC AGC GGC GGG TCC GGG CCC CGG GGG ATC 96 

2010 Val Val Leu Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly lie 

2011 15 20 25 

2 013 CAG GCT CTG CTG TGT GCG TGC ACC AGC TGC CTA CAG ACC AAC TAC ACC 14 4 

2 014 Gin Ala Leu Leu Cys Ala Cys Thr Ser Cys Leu Gin Thr Asn Tyr Thr 
2015 30 35 40 45 

2017 TGT GAG ACA GAT GGG GCT TGC ATG GTC TCC ATC TTT AAC CTG GAT GGC 192 

2018 Cys Glu Thr Asp Gly Ala Cys Met Val Ser lie Phe Asn Leu Asp Gly 

2019 50 55 60 

2021 GTG GAG CAC CAT GTA CGT ACC TGC ATC CCC AAG GTG GAG CTG GTT CCT 24 0 

2022 Val Glu His His Val Arg Thr Cys He Pro Lys Val Glu Leu Val Pro 

2023 65 70 75 

2025 GCT GGA AAG CCC TTC TAC TGC CTG AGT TCA GAG GAT CTG CGC AAC ACA 288 

2026 Ala Gly Lys Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr 

2027 80 85 90 

2 02 9 CAC TGC TGC TAT ATT GAC TTC TGC AAC AAG ATT GAC CTC AGG GTC CCC 336 

2030 His Cys Cys Tyr He Asp Phe Cys Asn Lys He Asp Leu Arg Val Pro 

2031 95 100 105 

2 033 AGC GGA CAC CTC AAG GAG CCT GCG CAC CCC TCC ATG TGG GGC CCT GTG 384 

2034 Ser Gly His Leu Lys Glu Pro Ala His Pro Ser Met Trp Gly Pro Val 

2035 110 115 120 125 

2037 GAG CTG GTC GGC ATC ATC GCC GGC CCC GTC TTC CTC CTC TTC CTT ATC 4 32 

2038 Glu Leu Val Gly He He Ala Gly Pro Val Phe Leu Leu Phe Leu He 

2039 130 135 140 
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2041 


ATT 


ATC 


ATC 


GTC 


TTC 


CTG 


GTC 


ATC 


AAC 


TAT 


CAC 


CAG 


CGT 


GTC 


TAC 


CAT 


480 


2042 


lie 


He 


He 


Val 


Phe 


Leu 


Val 


He 


Asn 


Tyr 


His 


Gin 


Arg 


Val 


Tyr 


His 




2043 








145 










150 










155 








2045 


AAC 


CGC 


CAG 


AGG 


TTG 


GAC 


ATG 


GAG 


GAC 


CCC 


TCT 


TGC 


GAG 


ATG 


TGT 


CTC 


528 


2046 


Asn 


Arg 


Gin 


Arg 


Leu 


Asp 


Met 


Glu 


Asp 


Pro 


Ser 


Cys 


Glu 


Met 


Cys 


Leu 




2047 






160 










165 










170 










2049 


TCC 


AAA 


GAC 


AAG 


ACG 


CTC 


CAG 


GAT 


CTC 


GTC 


TAC 


GAC 


CTC 


TCC 


ACG 


TCA 


576 


2050 


Ser 


Lys 


Asp 


Lys 


Thr 


Leu 


Gin 


Asp 


Leu 


Val 


Tyr 


Asp 


Leu 


Ser 


Thr 


Ser 




2051 




175 










180 










185 












2053 


GGG 


TCT 


GGC 


TCA 


GGG 


TTA 


CCC 


CTT 


TTT 


GTC 


CAG 


CGC 


ACA 


GTG 


GCC 


CGA 


624 


2054 


Gly 


Ser 


Gly 


Ser 


Gly 


Leu 


Pro 


Leu 


Phe 


Val 


Gin 


Arg 


Thr 


Val 


Ala 


Arg 




2055 


190 










195 










200 










205 




2057 


ACC 


ATT 


GTT 


TTA 


CAA 


GAG 


ATT 


ATC 


GGC 


AAG 


GGC 


CGG 


TTC 


GGG 


GAA 


GTA 


672 


2058 


Thr 


lie 


Val 


Leu 


Gin 


Glu 


He 


He 


Gly 


Lys 


Gly 


Arg 


Phe 


Gly 


Glu 


Val 




2059 










210 










215 










220 






2061 


TGG 


CGT 


GGT 


CGC 


TGG 


AGG 


GGT 


GGT 


GAC 


GTG 


GCT 


GTG 


AAA 


ATC 


TTC 


TCT 


720 


2062 


Trp 


Arg 


Gly 


Arg 


Trp 


Arg 


Gly 


Gly 


Asp 


■Val 


Ala 


Val 


Lys 


He 


Phe 


Ser 




2063 








225 










230 










235 








2065 


TCT 


CGT 


GAA 


GAA 


CGG 


TCT 


TGG 


TTC 


CGT 


GAA 


GCA 


GAG 


ATC 


TAC 


CAG 


ACC 


768 


2066 


Ser 


Arg 


Glu 


Glu 


Arg 


Ser 


Trp 


Phe 


Arg 


Glu 


Ala 


Glu 


He 


Tyr 


Gin 


Thr 




2067 






240 










245 










250 










2069 


GTC 


ATG 


CTG 


CGC 


CAT 


GAA 


AAC 


ATC 


CTT 


GGC 


TTT 


ATT 


GCT 


GCT 


GAC 


AAT 


* 816 


2070 


Val 


Met 


Leu 


Arg 


His 


Glu 


Asn 


He 


Leu 


Gly 


Phe 


He 


Ala 


Ala 


Asp 


Asn 




2071 




255 










260 










265 












2073 


AAA 


GAT 


AAT 


GGC 


ACC 


TGG 


ACC 


CAG 


CTG 


TGG 


CTT 


GTC 


TCT 


GAC 


TAT 


CAC 


864 


2074 


Lys 


Asp 


Asn 


Gly 


Thr 


Trp 


Thr 


Gin 


Leu 


Trp 


Leu 


Val 


Ser 


Asp 


Tyr 


His 




2075 


270 










275 










280 










285 




2077 


GAG 


CAT 


GGC 


TCA 


CTG 


TTT 


GAT 


TAT 


CTG 


AAC 


CGC 


TAC 


ACA 


GTG 


ACC 


ATT 


912 


2078 


Glu 


His 


Gly 


Ser 


Leu 


Phe 


Asp 


Tyr 


Leu 


Asn 


Arg 


Tyr 


Thr 


Val 


Thr 


He 




2079 










290 










295 










300 






2081 


GAG 


GGA 


ATG 


ATT 


AAG 


CTA 


GCC 


TTG 


TCT 


GCA 


GCC 


AGT 


GGT 


TTG 


GCA 


CAC 


960 


2082 


Glu 


Gly Met 


He 


Lys 


Leu 


Ala 


Leu 


Ser 


Ala 


Ala 


Ser 


Gly 


Leu 


Ala 


His 




2083 








305 










310 










315 








2085 


CTG 


CAT 


ATG 


GAG 


ATT 


GTG 


GGC 


ACT 


CAA 


GGG 


AAG 


CCG 


GGA 


ATT 


GCT 


CAT 


1008 


2086 


Leu 


His 


Met 


Glu 


He 


Val 


Gly 


Thr 


Gin 


Gly 


Lys 


Pro 


Gly 


He 


Ala 


His 




2087 






320 










325 










330 










2089 


CGA 


GAC 


TTG 


AAG 


TCA 


AAG 


AAC 


ATC 


CTG 


GTG 


AAA 


AAA 


AAT 


GGC 


ATG 


TGT 


1056 


2090 


Arq 


Asp 


Leu 


Lys 


Ser 


Lys 

2 


Asn 


He 


Leu 


Val 


Lys 


Lys 


Asn 


Gly 


Met 


Cys 




2091 




335 










340 










345 












2093 


GCC 


ATT 


GCA 


GAC 


CTG 


GGC 


CTG 


GCT 


GTC 


CGT 


CAT 


GAT 


GCG 


GTC 


ACT 


GAC 


1104 


2094 


Ala 


He 


Ala 


Asp 


Leu 


Gly 


Leu 


Ala 


Val 


Arg 


His 


Asp 


Ala 


Val 


Thr 


Asp 




2095 


350 










355 










360 










365 




2097 


ACC 


ATA 


GAC 


ATT 


GCT 


CCA 


AAT 


CAG 


AGG 


GTG 


GGG 


ACC 


AAA 


CGA 


TAC 


ATG 


1152 


2098 


Thr 


He 


Asp 


He 


Ala 


Pro 


Asn 


Gin 


Arg 


Val 


Gly 


Thr 


Lys 


Arg 


Tyr 


Met 




2099 










370 










375 










380 






2101 


GCT 


CCT 


GAA 


GTC 


CTT 


GAC 


GAG 


ACA 


ATC 


AAC 


ATG 


AAG 


CAC 


TTT 


GAC 


TCC 


1200 


2102 


Ala 


Pro 


Glu 


Val 


Leu 


Asp 


Glu 


Thr 


He 


Asn 


Met 


Lys 


His 


Phe 


Asp 


Ser 




2103 








385 










390 










395 








2105 


TTC 


AAA 


TGT 


GCC 


GAC 


ATC 


TAT 


GCC 


CTC 


GGG 


CTT 


GTC 


TAC 


TGG 


GAG 


ATT 


1248 
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2106 Phe Lys Cys Ala Asp lie Tyr Ala Leu Gly Leu Val Tyr Trp Glu lie 

2107 400 405 410 

2109 GCA CGA AGA TGC AAT TCT GGA GGA GTC CAT GAA GAC TAT CAA CTG CCG 129 6 

2110 Ala Arg Arg Cys Asn Ser Gly Gly Val His Glu Asp Tyr Gin Leu Pro 

2111 415 420 425 

2113 TAT TAC GAC TTA GTG CCC TCC GAC CCT TCC ATT GAG GAG ATG CGA AAG - ~ 134 4 

2114 Tyr Tyr Asp Leu Val Pro Ser Asp Pro Ser lie Glu Glu Met Arg Lys 

2115 430 435 440 445 

2117 GTT GTA TGT GAC CAG AAG CTA CGG CCC AAT GTC CCC AAC TGG TGG CAG 1392 

2118 Val Val Cys Asp Gin Lys Leu Arg Pro Asn Val Pro Asn Trp Trp Gin 

2119 450 455 460 

2121 AGT TAT GAG GCC TTG CGA GTG ATG GGA AAG ATG ATG CGG GAG TGC TGG 144 0 

2122 Ser Tyr Glu Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp 

2123 465 470 475 

212 5 TAC GCC AAT GGT GCT GCC CGT CTG ACA GCT CTG CGC ATC AAG AAG ACT 14 8 8 

2126 Tyr Ala Asn Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr 

2127 480 485 490 

2129 CTG TCC CAG CTA AGC GTG CAG GAA GAT GTG AAG ATT TAAGCTGTTC 1534 

2130 Leu Ser Gin Leu Ser Val Gin Glu Asp Val Lys lie 

2131 495 500 505 

E--> 2133 CTCTGCCTAC ACAAAGAACC TGGGCAGTGA GGATGACTGC AGCCACCGTG 

2134 CAAGCGTCGT 1594 

E--> 2136 GGAGGCCTAT CCTCTTGTTT CTGCCCGGCC CTCTGGCAGA GCCCTGGCCT 

2137 GCAAGAGGGA 16 54 

E--> 2139 CAGAGCCTGG GAGACGCGCG CACTCCCGTT G GGT TTG AG A CAGACACTTT 

214 0 TTATATTTAC 1714 

E--> 2142 CTCCTGATGG CATGGAGACC TGAGCAAATC ATGTAGTCAC TCAATGCCAC 

214 3 AACTCAAACT 1774 

E--> 2145 GCTTCAGTGG G AAG TAC AG A GACCCAGTGC ATTGCGTGTG CAGGAGCGTG 

214 6 AGGTGCTGGG 1834 

E--> 2148 CTCGCCAGGA GCGGCCCCCA TACCTTGTGG TCCACTGGGC TGCAGGTTTT 

214 9 CCTCCAGGGA 1894 

E--> 2151 CCAGTCAACT GGCATCAAGA TATTGAGAGG AACCGGAAGT TTCTCCCTCC 

2152 TTCCCGTAGC 19 54 

E--> 2154 AGTCCTGAGC CACACCATCC TTCTCATGGA CATC CGG AGG ACTGCCCCTA 

2155 GAGACACAAC 2014 

E--> 2157 CTGCTGCCTG TCTGTCCAGC CAAGTGCGCA TGTGCCGAGG TGTGTCCCAC 

2158 ATTGTGCCTG 2074 

E--> 2160 GTCTGTGCCA CGCCCGTGTG TGTGTGTGTG TGTGTGAGTG AGTGTGTGTG 

2161 TGTACACTTA 2134 

216 3 ACCTGCTTGA GCTTCTGTGC ATGTGT 216 0 

2242 (2) INFORMATION FOR SEQ ID NO: 17: 
224 3 (i) SEQUENCE CHARACTERISTICS: 

2244 (A) LENGTH: 1952 base pairs 

2245 (B) TYPE: nucleic acid 

2246 (C) STRANDEDNESS : unknown 
224 7 (D) TOPOLOGY: unknown 

224 9 (ii) MOLECULE TYPE: cDNA 

2251 (iii) HYPOTHETICAL: NO 
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c--> 


2253 




(iv) 


ANTI-SENSE: 


NO 


























2255 




(v) 


FRAGMENT TYPE: 


internal 




















2257 




(vi) 


ORIGINAL SOURCE: 
























2258 






(A) ORGANISM: 


Mouse 






















2260 




(ix) 


FEATURE : 




























2261 






(A) NAME/KEY: 


CDS 
























2262 






(B) LOCATION: 


187 


. .1692 




















2264 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 17: 












E- - > 


2266 


AAGCGGCGGC AGAAGTTGCC GGCGTGGTGC TCGTAGTGAG 


GGCGCGGAGG 










2267 


ACCCGGGACC 




60 


























E--> 


2269 


TGGGAAGCGG CGGCGGGTTA ACTTCGGCTG AATCACAACC 


ATTTGGCGCT 










2270 


GAGCTATGAC 


120 


























E- - > 


2272 


AAGAGAGCAA AC AAAAAG T T AAAGGAGCAA CCCGGCCATA 


AG TG AAG AG A 










2273 


GAAGTTTATT 


180 




























2275 


GATAAC ATG CTC TTA CGA AGC TCT GGA AAA TTA AAT GTG GGC ACC AAG 


228 




2276 




Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys • 






2277 






1 








5 










10 














2279 


AAG 


GAG 


GAT 


GGA 


GAG 


AGT 


ACA 


GCC 


CCC 


ACC 


CCT 


CGG 


CCC 


AAG 


ATC 


CTA 


276 




2280 


Lys 


Glu 


Asp 


Gly 


Glu 


Ser 


Thr 


Ala 


Pro 


Thr 


Pro 


Arg 


Pro 


Lys 


He 


Leu 






2281 


15 










20 










25 










30 






2283 


CGT 


TGT 


AAA 


TGC 


CAC 


CAC 


CAC 


TGT 


CCG 


GAA 


GAC 


TCA 


GTC 


AAC 


AAT 


ATC 


324 




2284 


Arg 


Cys 


Lys 


Cys 


His 


His 


His 


Cys 


Pro 


Glu 


Asp 


Ser 


Val 


Asn 


Asn 


He 






2285 










35 










40 










45 








2287 


TGC 


AGC 


ACA 


GAT 


GGG 


TAC 


TGC 


TTC 


ACG 


ATG 


ATA 


GAA 


GAA 


GAT 


GAC 


TCT 


372 




2288 


Cys 


Ser 


Thr 


Asp 


Gly 


Tyr 


Cys 


Phe 


Thr 


Met 


He 


Glu 


Glu 


Asp 


Asp 


Ser 






2289 








50 










55 










60 










2291 


GGA 


ATG 


CCT 


GTT 


GTC 


ACC 


TCT 


GGA 


TGT 


CTA 


GGA 


CTA 


GAA 


GGG 


TCA 


GAT 


420 




2292 


Gly 


Met 


Pro 


Val 


Val 


Thr 


Ser 


Gly 


Cys 


Leu 


Gly 


Leu 


Glu 


Gly 


Ser 


Asp 






2293 






6 5 










70 










75 












2295 


TTT 


CAA 


TGT 


CGT 


GAC 


ACT 


CCC 


ATT 


CCT 


CAT 


CAA 


AGA 


AGA 


TCA 


ATT 


GAA 


468 




2296 


Phe 


Gin 


Cys 


Arg 


Asp 


Thr 


Pro 


He 


Pro 


His 


Gin 


Arg 


Arg 


Ser 


He 


Glu 






2297 




80 










85 










90 














2299 


TGC 


TGC 


ACA 


GAA 


AGG 


AAT 


GAG 


TGT 


AAT 


AAA 


GAC 


CTC 


CAC 


CCC 


ACT 


CTG 


516 




2300 


Cys 


Cys 


Thr 


Glu 


Arg 


Asn 


Glu 


Cys 


Asn 


Lys 


Asp 


Leu 


His 


Pro 


Thr 


Leu 






2301 


95 










100 










105 










110 






2303 


CCT 


CCT 


CTC 


AAG 


GAC 


AGA 


GAT 


TTT 


GTT 


GAT 


GGG 


CCC 


ATA 


CAC 


CAC 


AAG 


564 




2304 


Pro 


Pro 


Leu 


Lys 


Asp 


Arg 


Asp 


Phe 


Val 


Asp 


Gly 


Pro 


He 


His 


His 


Lys 






2305 










115 










120 










125 








2307 


GCC 


TTG 


CTT 


ATC 


TCT 


GTG 


ACT 


GTC 


TGT 


AGT 


TTA 


CTC 


TTG 


GTC 


CTC 


ATT 


612 




2308 


Ala 


Leu 


Leu 


He 


Ser 


Val 


Thr 


Val 


Cys 


Ser 


Leu 


Leu 


Leu 


Val 


Leu 


He 






2309 








130 










135 










140 










2311 


ATT 


TTA 


TTC 


TGT 


TAC 


TTC 


AGG 


TAT 


AAA 


AGA 


CAA 


GAA 


GCC 


CGA 


CCT 


CGG 


660 




2312 


He 


Leu 


Phe 


Cys 


Tyr 


Phe 


Arg 


Tyr 


Lys 


Arg 


Gin 


Glu 


Ala 


Arg 


Pro 


Arg 






2313 






145 










150 










155 












2315 


TAC 


AGC 


ATT 


GGG 


CTG 


GAG 


CAG 


GAC 


GAG 


ACA 


TAC 


ATT 


CCT 


CCT 


GGA 


GAG 


708 




2316 


Tyr 


Ser 


He 


Gly 


Leu 


Glu 


Gin 


Asp 


Glu 


Thr 


Tyr 


He 


Pro 


Pro 


Gly 


Glu' 






2317 




160 










165 










170 














2319 


TCC 


CTG 


AGA 


GAC 


TTG 


ATC 


GAG 


CAG 


TCT 


CAG 


AGC 


TCG 


GGA 


AGT 


GGA 


TCA 


756 




2320 


Ser 


Leu 


Arg 


Asp 


Leu 


He 


Glu 


Gin 


Ser 


Gin 


Ser 


Ser 


Gly 


Ser 


Gly 


Ser 
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2321 175 180 185 190 

2 32 3 GGC CTC CCT CTG CTG GTC CAA AGG ACA ATA GCT AAG CAA ATT CAG ATG 804 

2324 Gly Leu Pro Leu Leu Val Gin Arg Thr lie Ala Lys Gin lie Gin Met 

2325 195 200 205 

2327 GTG AAG CAG ATT GGA AAA GGC CGC TAT GGC GAG GTG TGG ATG GGA AAG 852 

2328 Val Lys Gin lie Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys 

2329 210 215 220 

2 331 TGG CGT GGA GAA AAG GTG GCT GTG AAA GTG TTC TTC ACC ACG GAG GAA 900 

2332 Trp Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu 

2333 225 230 235 

2 335 GCC AGC TGG TTC CGA GAG ACT GAG ATA TAT CAG ACG GTC CTG ATG CGG 94 8 

2336 Ala Ser Trp Phe Arg Glu Thr Glu lie Tyr Gin Thr Val Leu Met Arg 

2337 240 245 250 

2 339 CAT GAG AAT ATT CTG GGG TTC ATT GCT GCA GAT ATC AAA GGG ACT GGG 996 

2340 His Glu Asn lie Leu Gly Phe lie Ala Ala Asp lie Lys Gly Thr Gly 

2341 255 260 265 270 

2 34 3 TCC TGG ACT CAG TTG TAC CTC ATC ACA GAC TAT CAT GAA AAC GGC TCC 104 4 

2 344 Ser Trp Thr Gin Leu Tyr Leu lie Thr Asp Tyr His Glu Asn Gly Ser 
2345 275 280 285 

2 34 7 CTT TAT GAC TAT CTG AAA TCC ACC ACC TTA GAC GCA AAG TCC ATG CTG 1092 

2 34 8 Leu Tyr Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met Leu 

2349 290 295 300 

2 3 51 AAG CTA GCC TAC TCC TCT GTC AGC GGC CTA TGC CAT TTA CAC ACG GAA 114 0 

2 352 Lys Leu Ala Tyr Ser Ser Val Ser Gly Leu Cys His Leu His Thr Glu 

2353 305 310 315 

2 355 ATC TTT AGC ACT CAA GGC AAG CCA GCA ATC GCC CAT CGA GAC TTG AAA 1188 

2356 lie Phe Ser Thr Gin Gly Lys Pro Ala lie Ala His Arg Asp Leu Lys 

2357 320 325 330 

2359 AGT AAA AAC ATC CTG GTG AAG AAA AAT GGA ACT TGC TGC ATA GCA GAC 1236 

2 36 0 Ser Lys Asn lie Leu Val Lys Lys Asn Gly Thr Cys Cys lie Ala Asp 

2361 335 340 345 350 

2363 CTG GGC TTG GCT GTC AAG TTC ATT AGT GAC ACA AAT GAG GTT GAC ATC 1284 

2364 Leu Gly Leu Ala Val Lys Phe lie Ser Asp Thr Asn Glu Val Asp lie 

2365 355 360 365 

2 367 CCA CCC AAC ACC CGG GTT GGC ACC AAG CGC TAT ATG CCT CCA GAA GTG 1332 

2368 Pro Pro Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro Glu Val 

2369 370 375 380 

2 371 CTG GAC GAG AGC TTG AAT AGA AAC CAT TTC CAG TCC TAC ATT ATG GCT 1380 

2 372 Leu Asp Glu Ser Leu Asn Arg Asn His Phe Gin Ser Tyr lie Met Ala 

2373 385 390 395 

2 375 GAC ATG TAC AGC TTT GGA CTC ATC CTC TGG GAG ATT GCA AGG AGA TGT 1428 

2 376 Asp Met Tyr Ser Phe Gly Leu lie Leu Trp Glu lie Ala Arg Arg Cys 

2377 400 405 410 

2 379 GTT TCT GGA GGT ATA GTG GAA GAA TAC CAG CTT CCC TAT CAC GAC CTG 14 76 

2380 Val Ser Gly Gly lie Val Glu Glu Tyr Gin Leu Pro Tyr His Asp Leu 

2381 415 420 425 ■ 430 

2383 GTG CCC AGT GAC CCT TCT TAT GAG GAC ATG AGA GAA ATT GTG TGC ATG 1524 

2384 Val Pro Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu lie Val Cys Met 

2385 435 440 445 
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2387 


AAG 


AAG 


TTA 


CGG 


CCT 


TCA 


TTC 


CCC 


AAT 


CGA 


TGG 


AGC 


AGT 


GAT 


GAG 


TGT 




2388 


Lys 


Lys 


Leu 


Arg 


Pro 


Ser 


Phe 


Pro 


Asn 


Arg 


Trp 


Ser 


Ser 


Asp 


Glu 


Cys 




2389 








450 










455 










460 








2391 


CTC 


AGG 


CAG 


ATG 


GGG 


AAG 


CTT 


ATG 


ACA 


GAG 


TGC 


TGG 


GCG 


CAG 


AAT 


CCT 




2392 


Leu 


Arg 


Gin 


Met 


Gly 


Lys 


Leu 


Met 


Thr 


Glu 


Cys 


Trp 


Ala 


Gin 


Asn 


Pro 




2393 






465 










470 










475 










2395 


GCC 


TCC 


AGG 


CTG 


ACG 


GCC 


CTG 


AGA 


GTT 


AAG 


AAA 


ACC 


CTT 


GCC 


AAA 


ATG 




2396 


Ala 


Ser 


Arg 


Leu 


Thr 


Ala 


Leu 


Arg 


Val 


Lys 


Lys 


Thr 


Leu 


Ala 


Lys 


Met 




2397 




480 










485 










490 










E--> 


2399 


TCA 


GAG 


TCC 


CAG 


GAC 


ATT 


AAA 


CTC 


TGACGTCAGA TACTTGTGGA CAGAGCAAGA 


W-- > 


2400 


1722 
































2401 


Ser 


Glu 


Ser 


Gin 


Asp 


He 


Lys 


Leu 


















W--> 


2402 


495 










500 






















E--> 


2404 


ATTTCACAGA AGCATCGTTA GCCCAAGCCT TGAACGTTAG 


CCTACTGCCC 








2405 


AGTGAGTTCA 


1782 
























E--> 


2407 


GACTTTCCTG G AAG AG AG C A CGGTGGGCAG ACACAGAGGA 


ACCCAGAAAC 








2408 


ACGGATTCAT 


1842 
























E--> 


2410 


CATGGCTTTC TGAGGAGGAG AAACTGTTTG GGTAACTTGT 


TCAAGATATG 








2411 


ATGCATGTTG 


1902 


























2413 


CTTTCTAAGA AAGCCCTGTA TTTTGAATTA CCATTTTTTT 


ATAAAAAAAA 








2752 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: : 


32 : 


















2753 




(i) 


SEQUENCE CHARACTERISTICS: 


















2754 






(A) LENGTH: 175 amino 


acids 
















2755 






(B) TYPE: 


amino acid 




















2756 






(D) TOPOLOGY: 


linear 


















- 


2758 




(ii) 


MOLECULE TYPE: 


peptide 




















2760 




(vi) 


ORIGINAL SOURCE: 






















2761 






(A) ORGANISM: 


Mouse 




















2763 




(xi) 


SEQUENCE DESCRIPTION: ! 


SEQ ID NO: 32 : 












2765 


Leu 


Leu 


Glu 


Val 


Lys 


Ala 


Arg 


Gly 


Arg 


Phe 


Gly 


Cys 


Val 


Trp 


Lys 


Ala 




2766 










5 










10 










15 






2767 


Gin 


Leu 


Leu 


Asn 


Glu 


Tyr 


Val 


Ala 


Val 


Lys 


He 


Phe 


Pro 


He 


Gin 


Asp 




2768 








20 










25 










30 








2769 


Lys 


Gin 


Ser 


Trp 


Gin 


Asn 


Glu ,Tyr 


Glu 


Val 


Tyr 


Ser 


Leu 


Pro Gly Met 




2770 






35 










40 










45 










2771 


Lys 


His 


Glu 


Asn 


He 


Leu 


Gin 


Phe 


He 


Gly 


Ala 


Glu 


Lys 


Arg 


Gly 


Thr 




2772 




50 










55 










60 












977 ? 


Ser 


Val 


Asp 


Val 


Asp 


Leu 


Trp 


Leu 


He 


Thr 


Ala 


Phe 


His 


Glu 


Lys 


Gly 




2774 


65 










70 










75 










80 




2775 


Ser 


Leu 


Ser 


Asp 


Phe 


Leu 


Lys 


Ala 


Asn 


Val 


Val 


Ser 


Trp 


Asn 


Glu 


Leu 




2776 










85 










90 










95 






2777 


Cys 


His 


He 


Ala 


Glu 


Thr 


Met 


Ala 


Arg 


Gly 


Leu 


Ala 


Tyr 


Leu 


His 


Glu 




2778 








100 










105 










110 








2779 


Asp 


He 


Pro 


Gly 


Leu 


Lys 


Asp 


Gly 


His 


Lys 


Pro 


Ala 


He 


Ser 


His 


Arg 




2780 






115 










120 










125 










2781 


Asp 


He 


Lys 


Ser 


Lys 


Asn 


Val 


Leu 


Leu 


Lys 


Asn 


Asn 


Leu 


Thr 


Ala 


Cys 




2782 




130 










135 










140 












2783 


He 


Ala 


Asp 


Phe 


Gly 


Leu 


Ala 


Leu 


Lys 


Phe 


Glu 


Ala 


Gly 


Lys 


Ser 


Ala 




2784 


145 










150 










155 










160 



1572 



1620 



1668 



1952 
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2785 


Gly 


Asp 


Thr 


His 


Gly 


Gin 


val 


Gly 


Thr 


Arg 


Arg 


Tyr 


Met 


Ala 


Pro 




2786 
































16! 


2909 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 35: 
















2910 




(i) 


SEQUENCE CHARACTERISTICS : 
















2911 






(A) LENGTH: 536 amino 


acids 














2912 






(B) TYPE: 


amino acid 


















2913 






(D) TOPOLOGY: 


linear 


















2915 




( ii ) 


MOLECULE TYPE: 


peptide 


















2917 




(vi) 


ORIGINAL SOURCE: 




















2918 






(A) ORGANISM: 


MOUSE 


















2920 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 35: 










2923 


Met 


Thr 


Ala 


Pro 


Trp 


Ala 


Ala 


Leu 


Ala 


Leu 


Leu 


Trp 


Gly 


Ser 


Leu 


Cys 


2924 










5 










10 










15 




2926 


Ala 


Gly 


Ser 


Gly 


Arg 


Gly 


Glu 


Ala 


Glu 


Thr 


Arg 


Glu 


Cys 


He 


Tyr 


Tyr 


2927 








20 










25 










30 






2928 


Asn 


Ala 


Asn 


Trp 


Glu 


Leu 


Glu 


Arg 


Thr 


Asn 


Gin 


Ser 


Gly 


Leu 


Glu 


Arg 


2929 






35 










40 










45 








2930 


Cvs 


Glu 


Gly 


Glu 


Gin 


Asp 


Lys 


Arg 


Leu 


His 


Cys 


Tyr 


Ala 


Ser 


Trp 


Arg 


2931 




50 










55 










60 










2932 


Asn 


Ser 


Ser 


Gly 


Thr 


He 


Glu 


Leu 


Val 


Lys 


Lys 


Gly 


Cys 


Trp 


Leu 


Asp 


2933 


65 










70 










75 










80 


2934 


Asp 


Phe 


Asn 


Cys 


Tyr 


Asp 


Arg 


Gin 


Glu 


Cys 


Val 


Ala 


Thr 


Glu 


Glu 


Asn 


2935 










85 










90 










95 




2936 


Pro 


Gin 


Val 


Tyr 


Phe 


Cys 


Cys 


Cys 


Glu 


Gly 


Asn 


Phe 


Cys 


Asn 


Glu 


Arg 


2937 








100 










105 










110 






2938 


Phe 


Thr 


His 


Leu 


Pro 


Glu 


Pro 


Gly 


Gly 

v j_ jr 


Pro 


Glu 


Val 


Thr 


Tyr 


Glu 


Pro 


2939 






115 










120 










125 








2940 


Pro 


Pro 


Thr 


Ala 


Pro 


Thr 


Leu 


Leu 


Thr 


Val 


Leu 


Ala 


Tyr 


Ser 


Leu 


Leu 


2941 




130 










135 










140 










2942 


Pro 


He 


Gly 


Gly 


Leu 


Ser 


Leu 


He 


Val 


Leu 


Leu 


Ala 


Phe 


Trp 


Met 


Tyr 


2943 


145 










150 










155 










160 


2944 


Arg 


His 


Arg 


Lys 


Pro 


Pro 


Tyr 


Gly 


His 


Val 


Asp 


He 


His 


GlU 


Val 


Arg 


2945 










165 










170 










175 




2946 


Gin 


Cys 


Gin 


Arg 


Trp 


Ala 


Gly 


Arg 


Arg 


Asp 


Gly 


Cys 


Ala 


Asp 


Ser 


Phe 


2947 








180 










185 










190 






2948 


Lvs 


Pro 


Leu 


Pro 


Phe 


Gin 


Asp 


Pro 


Gly 


Pro 


Pro 


Pro 


Pro 


Ser 


Pro 


Leu 


2949 






195 










200 










205 








2950 


Val 


Gly 


Leu 


Lys 


Pro 


Leu 


Gin 


Leu 


Leu 


Glu 


He 


Lys 


Ala 


Arcr 


Glv 


Arcr 


2951 




210 










215 










220 










2952 


Phe 


Gly 


Cys 


Val 


Trp 


Lys 


Ala 


Gin 


Leu 


Met 


Asn 


Asp 


Phe 


Val 


Ala 


Val 


2953 


225 










230 










235 










240 


2954 


Lys 


He 


Phe 


Pro 


Leu 


Gin 


Asp 


Lys 


Gin 


Ser 


Trp 


Gin 


Ser 


Glu 


Arg 


Glu 


2955 










245 










250 










255 




2956 


He 


Phe 


Ser 


Thr 


Pro 


Gly 


Met 


Lys 


His 


Glu 


Asn 


Leu 


Leu 


Gin 


Phe 


He 


2957 








260 










265 










270 






2958 


Ala 


Ala 


Glu 


Lys 


Arg 


Gly 


Ser 


Asn 


Leu 


Glu 


Val 


Glu 


Leu 


Trp 


Leu 


He 


2959 






275 










280 










285 








2960 


Thr 


Ala 


Phe 


His 


Asp 


Lys 


Gly 


Ser 


Leu 


Thr 


Asp 


Tyr 


Leu 


Lys 


Gly 


Asn 


2961 




290 










295 










300 











170 
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2962 


He 


He 


Thr 


Trp 


Asn 


Glu 


Leu 


Cys 


His 


Val 


Ala 


Glu 


Thr 


Met 


Ser 


Arg 


2963 


305 










310 










315 










320 


2964 


Gly 


Leu 


Ser 


Tyr 


Leu 


His 


Glu 


Asp 


Val 


Pro 


Trp 


Cys 


Arg 


Gly 


Glu 


Gly 


2965 










325 










330 










335 




2966 


His 


Lys 


Pro 


Ser 


He 


Ala 


His 


Arg 


Asp 


Phe 


Lys 


Ser 


Lys 


Asn 


Val 


Leu 


2967 








340 










345 










350 






2968 


Leu 


Lys 


Ser 


Asp 


Leu 


Thr 


Ala 


Val 


Leu 


Ala 


Asp 


Phe 


Gly 


Leu 


Ala 


Val 


2969 






355 










360 










365 








2970 


Arg 


Phe 


Glu 


Pro 


Gly 


Lys 


Pro 


Pro 


Gly 


Asp 


Thr 


His 


Gly 


Gin 


Val 


Gly 


2971 




370 










375 










380 










2972 


Thr 


Arg 


Arg 


Tyr 


Met 


Ala 


Pro 


Glu 


Val 


Leu 


Glu 


Gly Ala 


He 


Asn 


Phe 


2973 


385 










390 










395 










400 


2974 


Gin 


Arg 


Asp 


Ala 


Phe 


Leu 


Arg 


He 


Asp 


Met 


Tyr 


Ala 


Met 


Gly 


Leu 


Val 


2975 










405 










410 










415 




2976 


Leu 


Trp 


Glu 


Leu 


Val 


Ser 


Arg 


Cys 


Lys 


Ala 


Ala 


Asp 


Gly 


Pro 


Val 


Asp 


2977 








420 










425 










430 






2978 


Glu 


Tyr 


Met 


Leu 


Pro 


Phe 


Glu 


Glu 


Glu 


He 


Gly 


Gin 


His 


Pro 


Ser 


Leu 


2979 






435 










440 










445 








2980 


Glu 


Glu 


Leu 


Gin 


Glu 


Val 


Val 


Val 


His 


Lys 


Lys 


Met 


Arg 


Pro 


Thr 


He 


2981 




450 










455 










460 










2982 


Lys 


Asp 


His 


Trp 


Leu 


Lys 


His 


Pro 


Gly 


Leu 


Ala 


Gin 


Leu 


Cys 


Val 


Thr 


2983 


465 










470 










475 










480 


2984 


He 


Glu 


Glu 


Cys 


Trp 


Asp 


His 


Asp 


Ala 


Glu 


Ala 


Arg 


Leu 


Ser 


Ala 


Gly 


2985 










485 










490 










495 




2986 


Cys 


Val 


Glu 


Glu 


Arg 


Val 


Ser 


Leu 


lie 


Arg 


Arg 


Ser 


Val 


Asn 


Gly 


Thr 


2987 








500 










505 










510 






2988 


Thr 


Ser 


Asp 


Cys 


Leu 


Val 


Ser 


Leu 


Val 


Thr 


Ser 


Val 


Thr 


Asn 


Val 


Asp 



E--> 2989 

515 520 525 

2990 Leu Leu Pro Lys Glu Ser Ser He 
E--> 2991 530 535 

3081 (2) INFORMATION FOR SEQ ID NO: 37: 

3082 (i) SEQUENCE CHARACTERISTICS: 

3083 (A) LENGTH: 97 amino acids 

3084 (B) TYPE: amino acid 

3085 (D) TOPOLOGY: linear 
3087 (ii) MOLECULE TYPE: peptide 

3089 (vi) ORIGINAL SOURCE: 

3090 (A) ORGANISM: C. elegans 

3092 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

3095 Cys His Cys Ser Arg Glu Val Gly Cys Asn Ala Arg Thr Thr Gly Trp 
E--> 3096 5 10 15 

3097 Val Pro Gly He Glu Phe Leu Asn Glu Thr Asp Arg Ser Phe Tyr Glu 
E--> 3098 20 25 30 

3099 Asn Thr Cys Tyr Thr Asp Gly Ser Cys Tyr Gin Ser Ala Arg Pro Ser 
E--> 3100 35 40 45 

3101 Pro Glu He Ser His Phe Gly Cys Met Asp Glu Lys Ser Val Thr Asp 
E--> 3102 50 55 60 

3103 % Glu Thr Glu Phe His Asp Thr Ala Ala Lys Val Cys Thr Asn Asn Thr 

E--> 3104 65 70 75 80 
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3105 Lys Asp Pro His Ala Thr Val Trp lie Cys Cys Asp Lys Gly Asn Phe 
E--> 3106 85 90 95 

3107 Cys 

3111 (2) INFORMATION FOR SEQ ID NO: 38: 

3112 (i) SEQUENCE CHARACTERISTICS: 

3113 (A) LENGTH: 6 amino acids 

3114 (B) TYPE: amino acid 

3115 (D) TOPOLOGY: linear 
3117 (ii) MOLECULE TYPE: peptide 

3119 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

3121 Asp Leu Lys Pro Glu Asn 
E--> 3122 5 

3128 (2) INFORMATION FOR SEQ ID NO: 39: 

3129 (i) SEQUENCE CHARACTERISTICS: 

3130 (A.) LENGTH: 6 amino acids 

3131 (B) TYPE: amino acid 

3132 (D) TOPOLOGY: linear 
3134 (ii) MOLECULE TYPE: peptide 

3136 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:' 39: 

3138 Asp Leu Ala Ala Arg Asn 
E--> 3139 5 

314 2 (2) INFORMATION FOR SEQ ID NO: 40: 

314 3 (i) SEQUENCE CHARACTERISTICS: 

3144 (A) LENGTH: 6 amino acids 

3145 (B) TYPE: amino acid 
314 6 (D) TOPOLOGY: linear 
314 8 (ii) MOLECULE TYPE: peptide 

3150 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

3152 Asp lie Lys Ser Lys Asn 
E--> 3153 5 

3156 (2) INFORMATION FOR SE<Q ID NO: 41: 

3157 (i) SEQUENCE CHARACTERISTICS: 

3158 (A) LENGTH: 6 amino acids 

3159 (B) TYPE: amino acid 

3160 (D) TOPOLOGY: linear 
3162 (ii) MOLECULE TYPE: peptide 

3164 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

3166 Asp Phe Lys Ser Lys Asn 
E--> 3167 5 

3170 (2) INFORMATION FOR SEQ ID NO: 42: 

3171 (i) SEQUENCE CHARACTERISTICS: 

3172 (A) LENGTH: 6 amino acids 

3173 (B) TYPE: amino acid 

3174 (D) TOPOLOGY: linear 
3176 (ii) MOLECULE TYPE: peptide 

3178 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

3180 Asp Leu Lys Ser Ser Asn 
E--> 3181 5 

3184 (2) INFORMATION FOR SEQ ID NO: 43: 
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3185 (i) SEQUENCE CHARACTERISTICS: 

3186 (A) LENGTH: 6 amino acids 

3187 (B) TYPE: amino acid 

3188 (D) TOPOLOGY: linear 
3190 (ii) MOLECULE TYPE: peptide 
3192. (ix) FEATURE: 

3193 (D) OTHER INFORMATION: First Xaa is Thr or Ser; 

3194 fourth Xaa is Tyr or Phe; Each other Xaa 

3195 may be any amino acid 

3197 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

W--> 3199 Gly Xaa Xaa Xaa Xaa Xaa 
E--> 3200 5 

3203 (2) INFORMATION FOR SEQ ID NO: 44: 

3204 (i) SEQUENCE CHARACTERISTICS: 

3205 (A) LENGTH: 6 amino acids 

3206 (B) TYPE: amino acid 
32 07 (D) TOPOLOGY: linear 
3209 (ii) MOLECULE TYPE: peptide 

3211 (ix) FEATURE: 

3212 (D) OTHER INFORMATION: Fisrt Xaa is any amino acid; 

3213 second Xaa is lie or Val; 

3214 third Xaa is Lys or Arg; 

3215 fourth Xaa is Thr or Met. 

3217 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

W--> 3219 Xaa Pro Xaa Xaa Trp Xaa 
E--> 3220 5 

3223 (2) INFORMATION FOR SEQ ID NO: 45: 

3224 (i) SEQUENCE CHARACTERISTICS: 

3225 (A) LENGTH: 6 amino acids 

3226 (B) TYPE: amino acid 

3227 (D) TOPOLOGY: linear 
3229 (ii) MOLECULE TYPE: peptide 

3231 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

32 3 3 Gly Thr Arg Arg Tyr Met 
E--> 3234 5 

32 36 (2) INFORMATION FOR SEQ ID NO: 46: 

3237 (i) SEQUENCE CHARACTERISTICS: 

3238 (A) LENGTH: 6 amino acids 

3239 (B) TYPE: amino acid 

3240 (D) TOPOLOGY: linear 
3242 (ii) MOLECULE TYPE: peptide 

3244 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

3246 Gly Thr Ala Arg Tyr Met 
E--> 3247 5 
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L:27 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 

L:28 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 

L:85 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI -SENSE:] 

L:98 M:254 E: No. of Bases conflict, Input : 0 Counted:50 SEQ:1 



M. 


2 54 


Repeated in SeqNo=l 


















L 


111 


M: 


336 


W 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


1 


L 


207 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID: 


1 


L: 


209 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


1 


L: 


213 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


1 


L: 


217 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID: 


1 


L: 


221 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


1 


L: 


225 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


1 


L: 


229 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


1 


L: 


233 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID: 


1 


L: 


2 36 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


1 


L: 


238 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


1 


L: 


340 


M: 


220 


c 


Keyword 


misspelled 


or invalid format, [(iv) 


ANTI 


-SENSE: ] 


L: 


353 


M: 


254 


E 


No. of Bases conflict, Input 


0 Counted: 50 SEQ:3 






M: 


254 


Repeated in SeqNo=3 


















L- 


357 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3' 


L: 


442 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3 


L 


444 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3 


L 


448 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3 


L 


452 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID: 


3 


L 


456 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3 


L 


460 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3 


L 


464 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3 


L 


468 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3 


L 


472 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3 


L 


476 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID: 


3 


L 


480 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3 


L 


484 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3 


L 


488 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


3 


L 


542 


M: 


336 


w 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID: 


3 


L 


632 


M: 


220 


c 


Keyword 


misspelled 


or invalid format, [(iv) 


ANTI 


-SENSE: ] 


L 


645 


M: 


254 


E 


No. of Bases conflict, Input 


0 Counted: 50 SEQ:5 






M 


254 


Repeated in SeqNo=5 


















L 


721 


M: 


336 


w 


• Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 


L 


723 


M: 


336 


w 


: Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 


L 


727 


M: 


336 


w 


: Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 


L 


731 


M: 


336 


w 


: Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID: 


5 


L 


735 


M: 


336 


w 


: Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 


L 


739 


M: 


336 


w 


: Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 


L 


743 


M: 


336 


w 


: Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID: 


5 


L 


747 


M: 


336 


w 


: Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 


L 


751 


M: 


336 


w 


: Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 


L 


755 


M: 


336 


w 


: Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID: 


5 


L 


:759 


M: 


336 


w 


: Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/039 , 177D 



DATE: 10/15/2001 
TIME: 13:47:05 



Input Set : A:\Lud-5539 sequence listing.txt 
Output Set: N:\CRF3\10152001\I039177D.raw 



L: 


763 


M: 


336 


W 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 


L: 


767 


M: 


336 


W 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 


L: 


771 


M: 


336 


W 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 


L : 


775 


M: 


336 


W 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID: 


5 


L 


779 


M: 


336 


W 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID- 


5 


L. 


783 


M: 


336 


W 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID, 


5 


L: 


787 


M: 


336 


W 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID, 


5 


L 


791 


M: 


336 


W 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID 


5 


L. 


794 


M: 


336 


W 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region, 


SEQ 


ID 


5 


L: 


796 


M: 


336 


W 


Invalid 


Amino 


Acid 


Number 


in 


Coding 


Region , 


SEQ 


ID 


5 


L 


940 


M: 


220 


C 


Keyword 


misspelled 


or invalid format, [(iv) 


ANT I 


-SENSE: ] 


L 


1080 M:254 E: No. of 


Bases 


conflict/ Input :0 Counted: 1585 SEC 


:7 





M:254 Repeated in SeqNo=7 

L:1210 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 
L:1223 M:254 E: No. of Bases conflict, lnput:0 Counted: 50 SEQ: 9 
M:254 Repeated in SeqNo=9 

L:1227 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID : 9 J 

L:1389 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID : 9 

L:1479 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 
L:1492 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:11 
M:254 Repeated in SeqNo=ll 

L:1727 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 
L:1740 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:13 
M:254 Repeated in SeqNo=13 

L:1750 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 13 

L:1992 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 

L:2133 M:254 E: No. of Bases conflict, Input : 0 Counted:1584 SEQ:15 
M:254 Repeated in SeqNo=15 

L:2253 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 
L:2266 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ:17 
M:254 Repeated in SeqNo=17 

L:2503 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 

L:2521 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 

L:2539 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 

L:2557 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 

L:2575 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 

L:2593 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 

L:2611 M:220 C: Keyword misspelled or invalid format, [(iv) ANTI-SENSE:] 



L: 


2630 


M 


341 


W: 


(46) 


"n" or "Xaa" used, for SEQ 


ID# :26 








L: 


2786 


M 


332 


E: 


(32) 


Invalid/Missing 


Amino 


Acid 


Numbering, 


SEQ 


ID 


32 


L 


2989 


M 


332 


E: 


(32) 


Invalid/Missing 


Amino 


Acid 


Numbering, 


SEQ 


ID 


35 


M 


332 ] 


Repeated 


in SeqNo=35 














L 


-3096 


M 


332 


E: 


(32) 


Invalid/Missing 


Amino 


Acid 


Numbering, 


SEQ 


ID 


37 


M 


332 


Repeated 


in SeqNo=37 














L 


3122 


M 


332 


E: 


(32) 


Invalid/Missing 


Amino 


Acid 


Numbering, 


SEQ 


ID 


38 


L 


3139 


M 


332 


E: 


(32) 


Invalid/Missing 


Amino 


Acid 


Numbering, 


SEQ 


ID 


39 


L 


3153 


M 


332 


E : 


(32) 


Invalid/Missing 


Amino 


Acid 


Numbering, 


SEQ 


ID 


40 


L 


3167 


M 


332 


E - 


(32) 


Invalid/Missing 


Amino 


Acid 


Numbering, 


SEQ 


ID 


41 


L 


3181 


M 


332 


E 


(32) 


Invalid/Missing 


Amino 


Acid 


Numbering, 


SEQ 


ID 


42 
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VERIFICATION SUMMARY DATE: 10/15/2001 

PATENT APPLICATION: US/09/039 , 177D TIME: 13:47:05 



Input Set : A:\Lud-5539 sequence listing.txt 
Output Set: N:\CRF3\10152001\I039177D.raw 



L 


3199 


M 


341 


W 


(46) 


"n" or "Xaa" used, for SEQ 


ID# :43 








L 


3200 


M * 


332 


E 


(32) 


Invalid/Missing Amino Acid 


Numbering, 


SEQ 


ID 


43 


L 


3219 


M: 


341 


W 


(46) 


"n" or n Xaa n used, for SEQ 


ID# :44 








L 


3220 


M: 


332 


E 


(32) 


Invalid/Missing Amino Acid 


Numbering, 


SEQ 


ID 


44 


L 


3234 


M: 


332 


E 


(32) 


Invalid/Missing Amino Acid 


Numbering, 


SEQ 


ID 


45 


L 


3247 


M: 


332 


E 


(32) 


Invalid/Missing Amino Acid 


Numbering, 


SEQ 


ID 


46 
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NOTICE TO COMPLY WITH REQUIREMENTS FOR PATENT APPLICATIONS CONTAINING 
NUCLEOTIDE SEQUENCE AND/OR AMINO ACID SEQUENCE DISCLOSURES 

The nucleotide and/or amino acid sequence disclosure contained in this application does not 
comply with the requirements for such a disclosure as set forth in 37 C.F.R. 1 .821 - 1 .825 for the 
following reason(s): 



□ 
□ 

a 

□ 

□ 
□ 



LThis application clearly fails to comply with the requirements of 37 C.F.R. 1.821-1.825. Applicant's 
attention is directed to the final rulemaking notice published at 55 FR 18230 (May 1, 1990), and 1114 
OG 29 (May 1 5, 1 990). If the effective filing date is on or after July 1, 1 998, see the final rulemaking 
notice published at 63 FR 29620 (June 1, 1998) and 121 1 OG 82 (June 23, 1998). 

2. This application does not contain, as a separate part of the disclosure on paper copy, a "Sequence 
Listing" as required by 37 C.F.R. 1.821(c). 

3. A copy of the "Sequence Listing" in computer readable form has not been submitted as required by 
37 C.F.R. 1.821(e). 

4. A copy of the "Sequence Listing" in computer readable form has been submitted. However, the 
content of the computer readable form does not comply with the requirements of 37 C.F.R. 1 .822 
and/or 1.823, as indicated on the attached copy of the marked -up "Raw Sequence Listing." 

5. The computer readable form that has been filed with this application has been found to be damaged 
and/or unreadable as indicated on the attached CRF Diskette Problem Report. A Substitute 
computer readable form must be submitted as required by 37 C.F.R. 1.825(d). 

6. The paper copy of the "Sequence Listing" is not the same as the computer readable form of the 
"Sequence Listing" as required by 37 C.F.R. 1.821(e). 

7. Other: 



Applicant Must Provide: 



a 



A n in i t i a l or substitute computer readable form (CRF) copy of the "Sequence Listing". 



□ An initial or substitute paper copy of the "Sequence Listing", as well as an amendment directing its entry 
into the specification. 

H A statement that the content of the paper and computer readable copies are the same and, where 
L— ' applicable, include no new matter, as required by 37 C.F.R. 1.821(e) or 1.821(f) or 1.821(g) or 
1.825(b) or 1.825(d). 

For questions regarding compliance to these requirements, please contact: 

For Rules Interpretation, call (703) 308-4216 
For CRF Submission Help, call (703) 308-4212 
Patentln Software Program Support (SIRA) 

Technical Assistance 703-287-0200 

To Purchase Patentln Software 703-306-2600 



PLEASE RETURN A COPY OF THIS NOTICE WITH YOUR RESPONSE 



ERROR DETECTED 



RECEIVED 

•NOV-l 3-2001 - ; -v . 
iuwsc,uc« € «u,cu C E^rs«^ CENTER 1600/2900 

SUGGESTED CORRECTION . SERIAL WUMDER: 0^1 jo ^^ 

ATTN: NEW RULES CASES: fLEASE DISRECARJ) ENCLISH "ALPHA" HEADERS. WHICH WERE INSERTED D V T 

1 [/vJckocKd Nucleic* The numbcrAcxt at the end of each line "wrapped"* down to the nexl line. Jnu may occur if your file 

Wrapped Aminos wit retrieved in a word processor nflcr creating H. Plcxse adjust your right mar gin to J; this will 
prevent **wrapping," 

* 2 Invalid Line Length The rulci require that a line not c xceed 72 characters in length.' this includes white spaces. 

7 ~f • / 

3 k^ MUalitned Amino The numbering under each 5* amino acid ii misaligned. Do not use Ub codes between numbers; 

Numbering use apace characters, instead 

r - 4 Non- ASCII The submitted file wu not saved in ASCII(DOS) teat, as retire a* by the Sequence Rules. Please 

ensure your subsequent sub miia Ion b savtd In ASCII text. 

■' V 

5 _ Variable Length. Sequence^*) contain n*s or Xaa's representing more thah one residue. Per Sequence Rules, 

each i\ or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

6 Pa tent In 2.0 A "bug" in Pa tent In version 2.<ftvas cft^ed t\c <220>i<323> section to be missing from amino acid 

-bug** sequencers) . Normally, Patentln'would'automaa'cairy^g'encrate this section from the 

previously coded nucleic acid sequence. Please manually copy lh« relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies lo the mandatory <220>-<223> sections for 
Artificial or Unknown sequences, 

2 Skipped Sequences Sequences) mining. If intentional, please insert the following lines for each skipped sequence 

(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where Tu shown) 

(i) SEQUENCE CHARACTERISTICS: (Do np< insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQID.NO where "X" it shown) 
This sequence is intentionally skipped 

, ' * . Please also adjust the "(ii) NUMDER OF SEQUENCES:" response to Include the skipped sequences. 

8 Skipped Sequences Sequencers) missing. If Intentional, please insert the following lines for each skipped sequen< 

(NEW RULES) <2IO> sequence id number 

<400> sequence id number 

/ ■• 

9 Use of n'l or Xu'i Use of n*s and/or Xaa*s have been detected in the Sequence Listing. 

(NEW RULES) Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's arc present. 

In <220> to <223> section, please explain location of n or Xaa. and which residue n or Xaa represents 

10 Invalid <2 1 3> Per 1.8 23 of Sequence Rules, the only valid <2I3> responses are: Unknown, Artificial Sequence, or 

Response scientific 'name (Ccnus/sp^cics). <220>-<223> section is required when <2I3> response is Unknown 

u Artificial Sequence 

1 1 Use of <220> Scqucncc(s) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <2 13> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> Id <223> section. 

(Sec "Federal Regr$tcr."06A) 1/1998. Vol. 63. No. 104. pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

12 Patcntln 2.0 Please do not use "Copy to Disk" function of Patent In version 2.0. This causes a corrupted file. 

"bug" resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 

listing). Instead, please use "File Manager" or any other manual means to copy file lo floppy disk. 

; \\ tr 

13 ■ . Misuse of n * h can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used lo reprecci 

any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/2 1/200 1 





